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METHODS AND COMPOSITIONS FOR 
REGULATED TRANSCRIPTION AND 
• V EXPRESSION OF HETEROLOGOUS GENES 

CROSS REFERENCE TO RELATED 
APPLICATIONS 

This application is a continuation-in-part of U.S Ser No 
07/526,123 filed May 21, 1990 which is a continuation of 
U.S. Ser. No. 07/267,685 filed Nov. 2, 1988 (now 
abandoned) which is a continuation of U.S. Ser. No 06/692 10 
605 filed Jan. 17, 1985 (now abandoned). This application is 
also a continuation-in-part of U.S. Ser. No. 07/582,241 filed 
Sep. 14, 1990, now abandoned, which is a continuation of 
U S^Ser. No. 07/188361 filed Apr. 29, 1988 (abandoned) 
which is a continuation-in-part of U.S. Ser. No. 07/168 190 15 
filed Mar. 15, 1988, (now abandoned) which is a 
continuation-in-part of U.S. Ser. No. 07/054369 filed May 
26, 1987 (issued as U.S. Pat. No. 4,943,674). This applica- 
tion is also a continuation-in-part of U.S. Ser No 07/742 
834 filed Aug. 8, 1991, now U.S. Pat. No. 5,420,034, which 20 
is a continuation-in-part of U.S. Ser. No. 07/550,804 filed 
Jul. 9, 1990, now abandoned, which is a continuation-in-part 
of U.S. Ser. No. 07/147,781 filed Jan. 25, 1988 (abandoned) 
which is a continuation-in-part of U.S. Ser. No 07/078 538 
filed Jul. 28, 1987 (abandoned) which is a continuation-iii- 25 
part of U.S. Ser. No. 06/891 ,529 filed Jul. 31 1986 
(abandoned). ' 

INTRODUCTION 

1. Technical Field 30 
This invention relates to regulated genetic modification of 

plant material, particularly for tissue and/or developmental 
specific transcription and expression. Heterologous con- 
structs are provided whereby production of endogenous 
products can be modulated or new capabilities provided. 35 

2. Background of the Invention 

While the ability to manipulate bacterial and mammon 
cells by hybrid DNA technology has been available for 
almost a decade, only in 1983 was it first reported that 40 
successful expression of an exogenous gene was achieved in 
a plant cell Plants have a highly complex genome and differ 
in numerous ways from both bacterial and mammalian 
genes. Therefore, while as a first approximation, one may 
extrapolate from the experience with other species, the 45 
relevance of such experience must be determined by experi- 
mentation. In general, genetic engineering techniques have 
been directed to modifying the phenotype of individual 
prokaryotic and eukaryotic cells, especially in culture. Plant 
cells have proven more intransigent than other eukaryotic 50 
cells due not only to the lack of suitable vector systems but 
also a result of the different goals involved. Plant genetic 
engineering has for the most part been directed to modifying 
the entire plant or a particular tissue rather than modifying 
a single cell in culture. 55 

In order to be able to successfully modify plant cells, it 
will be necessary to develop a large number of different 
systems for introducing the exogenous DNA into the plant 
cell, for directing, as appropriate, the introduced DNA either 
randomly or to particular genomic sites, to provide for 60 
constitutive or regulated expression and, as appropriate, to 
provide for transport of the product to an appropriate site 
Toward this end, it will be necessary to develop a wide 
variety of regulatory signals involved with replication, 
transcription, translation, integration, and the like. To vary- 65 
ing degrees these regulatory signals will have general appli- 
cation across species or be species-specific, will be associ- 
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ated with specific stages of plant growth, or be subject to 
external control. To that extent, it will be necessary to 
develop a wide spectrum of regulatory sequences to allow 
for expression under predetermined conditions. 

5 For many applications, it will be desirable to provide for 
transcription in a particular plant tissue and/or at a particular 
time in the growth cycle of the plant or maturation cycle of 
the tissue. Toward this end, there is substantial interest in 
identifying endogenous plant products transcription or 

10 expression of which is regulated in a manner of interest. In 
identifying such products, one must first look for a product 
which appears at a particular time in the cell growth cycle or 
in a particular plant tissue, demonstrate its absence at other 
times or in other tissue, identify nucleic acid sequences 

15 associated with the product and then identify the sequence in 
the genome of the plant in order to obtain the 5'-untranslated 
sequence associated with transcription. Identifying the par- 
ticular sequence, followed by establishing that it is the 
correct sequence and isolating the desired transcriptional 

20 regulatory region requires an enormous outlay in time and 
effort. One must then prepare appropriate constructs, and 
demonstrate that the constructs are efficacious in the desired 
manner. 

There has been substantial interest in modifying the seed 
25 with transcriptional initiation regions to afford transcription 
and expression of the gene introduced into the seed, rather 
than constitutive expression which would result in expres- 
sion throughout the plant Also of interest is the ability to 
change the phenotype of fruit, so as to provide fruit which 
30 will have improved aspects for storage, handling, cooking, 
organoleptic properties, freezing, nutritional value, and the 
like. 

In addition, different systems may be required for the 
35 introduction of nucleic acid into plant cells to obtain rea- 
sonable efficiencies of transformation and functioning of the 
nucleic acid. Li many instances, such as the tumor inducing 
plasmids and viruses, the vectors have found limited utili- 
zation in their range of hosts. Therefore, different transfer- 
^ mation and replication systems may be required for different 
plant species. 

Relevant Literature 

Lack of transformation by Agrobacterium of soybean is 
reported by DeCleene and DeLey, The Botanical Review 

45 (1976) 42389-446. Encouraging results in the transforma- 
tion (Pederson et al., Plant CellRepts. (1983) 2:201-204 and 
Hood et al., Bio/Technology (1984) 2:702-708) and regen- 
eration (Christianson et al., Science (1983) 222: 632-634) of 
soybean have recently been reported. A light inducible 

so soybean SSU gene (small subunit SSU) of ribulose-1,5- 
bisphosphate-carboxylase (RuBP-carboxylase) is reported 
by Berry-Lowe et al., J. MoL Appln. Gen. (1982) 1:483-498. 
Sequences 5' to the pSSU gene were recently shown to direct 
foreign gene expression in a light-inducible manner when 

55 transferred into tobacco callus (Herrera-Estrella et at, 
Nature (1984) 310:115-120). 

Crouch et al., In: Molecular Form and Function of the 
Plant Genome, eds. van Vloten-Doting, Groot and Hall, 
Plenum Publishing Corp. 1985, pp 555-566; Crouch and 

60 Sussex, Planta (1981) 153:64-74; Crouch et al., J. MoL 
Appl. Genet (1983) 2:273-283; Simon et al., Plant Molecu- 
lar Biology (1985) 5:191-201; and Scofield and Crouch, J. 
Biol. Chem. (1987) 262:12202-12208, describe various 
aspects of Brassica napus storage proteins. Rose et al., Nucl 

65 Acids Res. (1987) 15:7197 and Scherer and Knauf, Plant 
MoL BioL ( 1987) 9: 127-134 describe ACP genes. Beachy et 
al., EMBOJ. (1985) 4:3047-3053; Sengupta-Gopalan et al., 
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Proc. Natl. Acad. Sci. U.SA. (1985) 82:3320-3324; Green- 
wood and Chrispeels, Plant Physiol. (1985) 79:65-71 and 
Chen et al., Proc. Natl. Acad. Sci. U.S.A. (1986) 
83:8560-8564 describe studies concerned with seed storage 
proteins and genetic manipulation. Eckes et al., Mol. Gen. 5 
Genet (1986) 205:14-22 and Fluhr et al., Science (1986) 
232:1106-1112 describe the genetic manipulation of light 
inducible plant genes. 

CDNA clones from tomato displaying differential expres- 
sion during fruit development have been isolated and char- 10 
acterized (Mansson et al., Mol. Gen. Genet (1985) 
200:356-361; Slater et al., Plant Mol. Biol (1985) 
5: 137-147). The studies have focused primarily on mRNAS 
which accumulate during fruit ripening- One of the proteins 
encoded by the ripening-specific cDNAs has been identified 15 
as polygalacturonase (Slater et al., Plant Mol. Biol. (1985) 
5:137-147). A cDNA clone which encodes tomato polyga- 
lacturonase has been sequenced. Grierson et al., Nucleic 
Acids Research (1986) 14:8395-8603. The concentration of 
polygalacturonase mRNA increases 2000-fold between the 20 
immature-green and red-ripe stages of fruit development 
Hiis suggests that expression of the enzyme is regulated by 
the specific MRNA concentration which in turn is regulated 
by an increase in transcription. Delia Penna et al., Proc. Natl. 
Acad. Sci. U.S.A. (1986) 83:6420^6424. Mature plastid 25 
mRNA for psbA (one of the components of photosystem II) 
reaches its highest level late in fruit development, whereas 
after the onset of ripening, plastid mRNAS for other com- 
ponents of photosystem I and II decline to nondetectable 
levels in chromoplasts. Piechulla et al., Plant Mol Biol 30 
(1986) 7:367-376. 

SUMMARY OF THE INVENTION 

Novel methods and DNA constructs are provided for 35 
transforming plants employing T-DNA and a Ti- or 
Ri-plasmid for heterologous DNA introduction and integra- 
tion into the plant genome. Transformation without gall 
formation of plant cells which have historically not been 
Agrobacterium hosts is achieved with successful expression ^ 
of the heterologous DNA. Additionally, DNA constructs are 
provided which are employed in manipulating plant cells to 
provide for regulated transcription, such as light inducible 
transcription, in a plant tissue or plant part of interest at 
particular stages of plant growth or in response to external 45 
control. Particularly, transcriptional regions from seed stor- 
age proteins, seed coat proteins or acyl carrier protein are 
joined to other than the homologous gene and introduced 
into a plant cell host for integration into the genome to 
provide for seed-specific transcription. The constructs pro- 50 
vide for modulation of expression of endogenous products 
as well as production of exogenous products in the seed. 
Novel DNA constructions also are provided employing a 
fruit-specific promoter, particularly a promoter from a gene 
active beginning at or shortly after anthesis or beginning at 55 
the breaker stage, joined to a DNA sequence of interest and 
a transcriptional terrnination region. A DNA construct may 
be introduced into a plant cell host for integration into the 
genome and transcription regulated at a time at or subse- 
quent to anthesis. In this manner, higji levels of RNA and, as ^ 
appropriate, polypeptides, may be achieved during forma- 
tion and/or ripening of fruit 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a partial sequence of the promoter region of the 65 
ABnNa napin gene. The start (ATG) of the open reading 
frame is underlined. 
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FIG. 2 is a restriction map of cloned ACGN1-2 showing 
the entire coding region sequence as well as extensive 5' 
upstream and 3' downstream sequences. 

FIG. 3 is a partial nucleotide sequence of genomic ACP 

5 clone Bcg44. The coding region is indicated by the three- 
letter amino acid codes. Breaks in the coding region 
sequence represent introns. The underlined nucleotide at 
position 3 10 is ambiguous without further sequence analysis 
for confirmation. 

10 FIG. 4 is the complete nucleotide sequence of B. campes- 
tris cDNA EA9. The longest open reading frame is desig- 
nated by the three letter amino acid code. PolyA tails are 
evident at the end of the sequence and a potential poiyade- 
nylation signal is underlined. 

15 FIG. 5 shows the nucleotide sequence of the cDNA clones 
pCGN1299 (2A11) andpCGN1298 (3HU).The amino acid 
sequence of the polypeptide encoded by the open reading 
frame is also indicated. 

^ FIG. 6 is a comparison of 2A11 to pea storage proteins 
and other abundant storage proteins: 

(a) 2A11 (residues 33-46) is compared to PAlb and the 
reactive site sequences of some protease inhibitors, 
Pa lb (residues 6-23), chick pea inhibitor (residues 

25 11-23), lima bean inhibitor (residues 23-35), human 
a 1 -antitrypsin reactive site peptide. The arrow indi- 
cates the reactive site. 

(b) is a comparison of the amino terminal sequence of 
2A11 with the amino termini of a range of seed 

30 proteins. Hie data have been modified or deletions 
introduced to maximize homology; conserved residues 
are shown boxed. The sequences are from the following 
sources: PAlb; barley chloroforrn/methanol-soluble 
protein d; wheat albumin; wheat a-amylase inhibitor 
35 0.28; millet bi-functional inhibitor; castor bean 2S 
small subunit; and napin small subunit. 
FIG. 7 shows the complete sequence of the 2A11 genomic 
DNA cloned into PCGN1273 from the Xhol site (position 1 
at the 5* end) to the EcoRI site (position 4654). 
40 FIG. 8 shows the nucleotide sequence of a polygalactu- 
ronase (PG) genomic clone. 

FIG. 9 shows 2A11 genomic constructs. The upper line 
shows a map of the 2A11 genomic clone. The transcriptional 
start site, the polyadenylation site, the start (ATG) and stop 
45 (TGA) sites and the position of the intron are indicated. The 
hatched region indicates the portion of the genomic clone 
that was used to make the tagged 2A11 constructions. The 
bottom portion shows the regions used to construct the 2A11 
cassettes including the synthetic oligonucleotide used to 
so insert restriction sites and reconstruct the 3' end. 

FIG. 1© shows examples of 2A11 cassettes. Four versions 
of the 2A11 cassette are shown. They differ only in the 
flanking poly-linker regions and in the antibiotic resistance 
marker on the plasmid. 

55 

DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

In accordance with the subject invention, DNA constructs 
are provided which allow for regulated modification of plant 

60 phenotype for example during fruit development and 
ripening, in specific plant structures derived from the ovum, 
and in chloroplast containing plant tissues such as leaves. 
The DNA constructs comprise a regulated transcriptional 
initiation region. Downstream from the regulated transcrip- 

65 tional initiation region will be a sequence of interest which 
will provide for regulated modification of plant phenotype, 
by modulating the production of an endogenous product, as 
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to amount, relative distribution, or the like, or production of 
an exogenous expression product to provide for a novel 
function or product. Thus genes of interest as a source of 
regulated transcriptional initiation regions include those 
genes associated with seed formation, preferably in associa- 5 
tion with embryogenesis and seed maturation and those 
associated with fruit maturation and ripening, fruit rotting 
and light-induced processes in chloroplasts. The transcrip- 
tional cassette will include in the 5'-3' direction of 
transcription, a regulated transcriptional and translation^ 
initiation region, a sequence of interest, and a transcriptional 10 
and translational termination region functional in plants. 
One or more introns may be also present 

In addition to the transcription construct, depending upon 
the manner of introduction of the transcription construct into 
the plant, other DNA sequences may be required. The 15 
subject invention includes a novel method provided for the 
introduction of foreign DNA employing T-DNA from an 
Agrobacterium plasmid, where efficient functional introduc- 
tion of heterologous DNA is achieved in plants normally 
considered outside the Agrobacterium range, e.g., mono- 20 
cotyledons and leguminous dicotyledons, without gall for- 
mation. The method can also be used with the known 
dicotyledon hosts of Agrobacterium. DNA constructs are 
made which can be inserted into an Agrobacterium plasmid 
for transfer to a plant host Plant hosts of particular interest 25 
are the grains and legumes. 

When using the Ti- or Ri-plasmid for transformation of 
plant cells, as described below, at least the right border and 
frequently both the right and left borders of the T-DNA of 
the Ti- or Ri-plasmids will be joined as flanking regions to 30 
the transcription construct The use of T-DNA as a flanking 
region in a construct for integration into a Ti- or Ri- plasmid 
has been described in EPO Application No. 116,718 and 
PCT Application Nos. WO84/02913, 02919 and 02920. See 
also Herrera-Estrella, Nature (1983) 303:209-213; Fraleyet 35 
at, Proc. NatL Acad. Sci, U.S. A. (1983) 80:4803-4807; 
Horsch et al., Science (1984) 223:496-498; and DeBlock et 
al., EMBOJ. (1984) 3:1681-1689. Various fragments may 
be employed in the constructions to provide for homology 
with the T-DNA of the tumor plasmids. The homology may 40 
involve structural genes, promoter regions, other untrans- 
lated regions such as border regions, or the like. 

Downstream from and under the transcriptional initiation 
regulation of the regulatable initiation region will be a 
sequence of interest which will provide for modification of 45 
the phenotype of the specific plant tissue or part Desirably, 
integration constructs may be prepared which allow for 
integration of the transcriptional cassette into the genome of 
a plant host Conveniently, the vector may include a multiple 
cloning site downstream from the regulated transcriptional 50 
initiation region, so that the integration construct may be 
employed for a variety of sequences in an efficient manner. 
The DNA construct will also provide for a termination 
region, so as to provide an expression cassette into which a 
gene may be introduced. Conveniently, transcriptional ini- 55 
tiation and termination regions may be provided separated in 
the direction of transcription by a linker or polylinker having 
one or a plurality of restriction sites for insertion of the gene 
to be under the transcriptional regulation of the regulatory 
regions. Usually, the linker will have from 1 to 10, more 60 
usually from about 1 to 8, rreferably from about 2 to 6 
restriction sites. Generally, the linker will be fewer than 100 
bp, frequently fewer than 60 bp and generally at least about 
5 bp. In conjunction with the subject method these con- 
structs may be used for the introduction of the structural 55 
gene into plant cells in culture, where the cells may be 
regenerated into whole plants. 
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The DNA constructs which are provided employ T-DNA 
flanking regions, flanking a structural gene including tran- 
scriptional and translational regulatory sequences. Thus, the 
construct which includes the structural gene, its transcrip- 
5 tional and translational regulatory controls, and the T-DNA 
flanking regions will for the most part have the following 
formula: 

io wherein: 

T 1 and T 2 are the same or different and are T-DNA from 
a H- plasmid or an Ri- plasmid, where a and b and 0 
or 1, at least 1 of a and b being 1; 
P is a promoter region recognized by a plant host, which 
15 promoter region may include promoters derived fromTi- or 
Ri- plasmids, such as the octopine synthase or nopaline 
synthase promoters, viral promoters, plant promoters, par- 
ticularly leguminous and monocotyledonous plant host pro- 
moters of various structural genes, e.g., RuBP-carboxylase, 
20 more particularly SSU. The promoter region will normally 
include a region for binding of RNA polymerase, as well as 
a cap site. In addition, there may be present enhancers, 
operators, activators, or other regions involved with tran- 
scriptional regulation. The transcriptional initiation region 
25 may be native or homologous to the host or foreign or 
heterologous to the host By foreign is intended that the 
transcriptional initiation region is not found in the wild-type 
host into which the transcriptional initiation region is intro- 
duced. 

30 S.G. intends a structural gene having an open reading 
frame and having at its S'-end an initiation codon and at its 
3'-end one or more nonsense codons. The DNA sequence 
may have any open reading frame encoding a peptide of 
interest, e.g. an enzyme, or a sequence complementary to a 

35 genomic sequence, where the genomic sequence may be an 
open reading frame, an intron, a non-coding leader 
sequence, or any other sequence where the complementary 
sequence will inhibit transcription, messenger RNA 
processing, e.g. splicing, or translation. Tne DNA sequence 

40 of interest may be synthetic, naturally derived, or combina- 
tions thereof. Depending upon the nature of the DNA 
sequence of interest, it may be desirable to synthesize the 
sequence with plant preferred codons. The plant preferred 
codons may be determined from the codons of highest 

45 frequency in the proteins expressed in the largest amount in 
the particular plant species of interest 

Te intends a termination region functional in the plant host 
cell. The termination region, besides including at least one 
terminating sequence, may also include a polyA signal. The 

so termination region which is employed will be primarily one 
of convenience, since the tennination regions appear to be 
relatively interchangeable. The termination region may be 
native with the transcriptional initiation region, may be 
native with the DNA sequence of interest, or may be derived 

55 from another source. Convenient tennination regions are 
available from the Ti-plasmid of A. tumefaciens, such as the 
octopine synthase and nopaline synthase termination 
regions. 

Identifying useful regulated transcriptional initiation 
60 regions may be achieved in a number of ways. For example, 
where a fruit or seed protein has been or is isolated, it is 
partially sequenced, so that a probe can be designed for 
identifying messenger RNA specific for fruit or seed. To 
further enhance the concentration of the messenger RNA 
65 specifically associated with fruit or seed, cDNA can be 
prepared and the cDNA subtracted with messenger RNA or 
cDNA from non-seed or non-fruit associated cells. The 
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residual cDNA can then be used for probing the genome for 
complementary sequences, using an appropriate library pre- 
pared from plant cells. Sequences which hybridize to the 
cDNA then can be isolated, manipulated, and the 
5'-untran slated region associated with the coding region 5 
isolated and used in expression constructs to identify the 
transcriptional activity of the 5'-untranslated region. In some 
instances, a probe may be employed directly for screening a 
genomic library and identifying sequences which hybridize 
to the probe. The sequences will be manipulated as 
described above to identify the S'-untranslated regions. 

As an example, a promoter of particular interest for the 
subject invention, the fruit-specific transcriptional initiation 
region (promoter) from a DNA sequence which encodes a 
protein described as 2 All in the experimental section was 
identified as follows. cDNA clones made from ripe fruit 15 
were screened using cDNA probes made from ripe fruit, 
green fruit, and leaf mRNA. Clones were selected having 
more intense hybridization with the fruit DNAs as con- 
trasted with the leaf cDNAs. The screening was repeated to 
identify a particular cDNA referred to as 2A11. The 2A11 20 
cDNA was then used for screening RNA from root, stem, 
leaf, and seven stages of fruit development after the mRNA 
was sized on gels. The screening demonstrated that the 
particular message was present throughout the seven stages 
of fruit development. The MRNA complementary to the 25 
specific cDNA was absent in other tissues which were tested. 
The cDNA was then used for screening a genomic library 
and a fragment selected which hybridized to the subject 
cDNA. The 5' and 3'non-coding regions were isolated and 
manipulated for insertion of a foreign sequence to be tran- 30 
scribed under the regulation of the 2 All promoter. 

The expression constructs which are prepared employing 
the regulated 5'-untranslated regions may be transformed 
into plant cells as described previously for evaluation of 
their ability to function with a heterologous structural gene 35 
(i.e., a gene other than the open reading frame associated 
with the S'-untranslated region) and specificity of expression 
for example in a particular plant tissue or plant part such as 
leaves, seed or fruit In this manner, specific sequences may 
be identified for use with sequences for fruit or seed-specific 40 
transcription and light-induced transcription. 

Several promoters are of particular interest These include 
the soybean SSU promoter, promoters from genes encoding 
storage proteins and seed embryo genes and those from 
genes that are activated at or shortly after anthesis. The 45 
transcriptional initiation region may be native or homolo- 
gous to the host or foreign or heterologous to the host By 
foreign relative to a particular host is intended that the 
transcriptional initiation region is not found in the wild-type 
host into which the transcriptional initiation region is intro- 50 
duced. Other fruit-specific promoters may be activated at 
times subsequent to anthesis, such as prior to or during the 
green fruit stage, during pre -ripe (e.g., breaker) or even into 
the red fruit stage. 

By use of the soybean SSU promoter, it is found that the 55 
expression of the gene under the SSU promoter can be 
light-induced. Thus, the expression of the gene is 
regulatable, where enhanced expression occurs during irra- 
diation with light, while substantially reduced expression or 
no expression occurs in the absence of light The nucleotide 60 
sequence of the small subunit gene is described by Berry- 
Lowe, /. MoL AppL Gen. (1982) 1:483-498. A Ddel digest 
of a plasmid containing a genomic fragment which includes 
the SSU soybean gene yields a 1.1 kd 5' piece that can be 
used as a promoter fragment 65 

Transcriptional initiation regions from genes encoding 
storage proteins, such as those from genes encoding napin, 
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cruciferin, (3-conglycinin, phaseolin, or the like, and proteins 
involved in fatty acid biosynthesis, such as acyl carrier 
protein (ACP) are also of interest. The transcriptional ini- 
tiation regions may be obtained from any convenient host, 
5 particularly plant hosts such as Brassica, e.g. napus or 
campestris, soybean {Glycine max), bean (Phaseolus 
vulgaris), corn (Zea mays), cotton (Gossypium sp.), saf- 
flower (Carthamus tinctorius) y tomato (Lycopersicon 
esculentum), and Cuphea species. 
io Other transcriptional initiation regions of particular inter- 
est are those associated with seed embryo genes that are 
expressed in the period from about day 7 to day 40, 
particularly those having maximum expression in the period 
from about day 10 to about day 30, postanthesis, and seed 
15 coat genes which are expressed in the period from about day 
1 1 to day 30. Usually the period of expression will be at least 
3 days, more usually about 7 days and may be substantially 
over the entire period. 

Also of interest is a transcriptional initiation region which 
20 is activated at or shortly after anrhesis, so that in the early 
development of the fruit, it provides the desired level of 
transcription of the sequence of interest. Normally, the 
sequence of interest will be involved in affecting the process 
in the early formation of the fruit or providing a property 
25 which is desirable during the growing (expansion) period of 
the fruit, or at or after harvesting. 

The ripening stages of the tomato may be broken down 
into mature green, breaker, turning, pink, light red and red. 
Desirably, the transcriptional initiation region maintains its 
30 activity during the expansion and maturation of the green 
fruit, more desirably continues active through the ripening 
or red fruit period. Comparable periods for other fruit are 
referred to as stages of ripening. The invention is not limited 
to those transcriptional initiation regions which are activated 
35 at or shortly after anthesis but also includes transcriptional 
initiation regions which are activated at any of the ripening 
stages of the fruit. An example of a fruit-specific transcrip- 
tional initiation region is the one referred to as 2 A 11 which 
regulates the expression of a 2 All cDNA sequence 
40 described in the Experimental section. The 2 All transcrip- 
tional initiation region provides for an abundant messenger, 
being activated at or shortly after anrhesis and remaining 
active until the red fruit stage. The expressed protein is a 
sulfur-rich protein similar to other plant storage proteins in 
45 sulfur content and size. 

Also of interest is a transcriptional initiation region which 
regulates expression of the enzyme polygalacturonase, an 
enzyme which plays an important role in fruit softening 
and/or rotting. The polygalacturonase promoter is active in 
50 at least the breaker through red fruit stage in tomato fruit 
Any structural gene of interest may be employed for use 
in the construct In many instances, it will be desirable to 
have another structural gene to serve as a marker associated 
with the construct, so that one can detect those plant cells in 
55 which the foreign gene has been stably introduced. For the 
most part, these constructs will have the following formula: 

(T*) 0 — P l — (S.G.) 1 — Tk 1 ) — (P 2 — (S.G.) 2 — Tfe 2 )— (T 4 )* 

wherein: 

6b all of the symbols have the same functional definition 
except that the superscripts for P and Te intend that the 
promoter and terminator regions may be the same or 
different, where one is a marker and the other is a structural 
gene of interest. Of course, one may provide for a string of 

65 expression constructs having a plurality of the same or 
different genes in the construct. Thus, the presence of only 
two genes flanked by the T-DNA is merely illustrative. 
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As markers for structural genes, one can employ antibi- 
otic resistance genes, e.g., a kanamycin resistance gene or 
methotrexate resistance gene (DHFR). These genes are 
described in Haas and Dow ding, supra. Other markers 
include resistance to a biocide, particularly an antibiotic, 5 
such as G418, bleomycin, hygromycin, chloramphenicol, or 
the like. The particular marker employed will be one which 
will allow for selection of transformed cells as compared to 
cells lacking the DNA which has been introduced. 

The structural gene of interest may be any gene, either 10 
native, mutant native, or foreign to the plant host, and may 
be provided in a sense or antisense orientation. For native 
and mutant genes, the gene may provide for increased 
capability of protein storage, improved nutrient source, 
enhanced response to light, enhanced dehydration 15 
resistance, e.g., to heat, salinity or osmotic pressure, herbi- 
cide resistance, e.g., glyphosate, or the like. Foreign genes 
may include enhancement of native capabilities, herbicide 
resistance, resistance to various pests, such as viruses, 
insects, bacteria or fungi, production of foreign products, as 20 
a result of expression of one or more foreign genes, or the 
like. 

In preparing the cassette construct, the various DNA 
sequences may normally be inserted or substituted into a 
bacterial pi as mid. Any convenient plasmid may be 25 
employed, which will be characterized by having a bacterial 
replication system, a marker which allows for selection in 
the bacterium and generally one or more unique, conve- 
niently located restriction sites. These plasmids, referred to 
as vectors, may include such vectors as pACYC184, 30 
pACYC177, pBR322, pUC9, the particular plasmid being 
chosen based on the nature of the markers, the availability 
of convenient restriction sites, copy number, and the like. 
Thus, the sequence may be inserted into the vector at an 
appropriate restriction site(s), the resulting plasmid used to 35 
transform the E. coli host, the E. coli grown in an appropriate 
nutrient medium and the cells harvested and lysed and the 
plasmid recovered. One then defines a strategy which allows 
for the stepwise combination of the different fragments. 

As necessary, the fragments may be modified by employ- 40 
ing synthetic adapters, adding Linkers, employing in vitro 
mutagenesis or primer repair to introduce specific changes in 
the sequence, which may allow for the introduction of a 
desired restriction site, for removing superfluous base pairs, 
or the like. By appropriate strategies, one desires to mini- 45 
mize the number of manipulations required as well as the 
degree of selection required at each stage of manipulation. 
After each manipulation, the vector containing the manipu- 
lated DNA may be cloned, the clones containing the desired 
sequence isolated, and the vector isolated and purified. As 50 
appropriate, hybridization, restriction mapping or sequenc- 
ing may be employed at each stage to ensure the integrity 
and correctness of the sequence. 

The cassette constructs may be introduced into the plant 
host cell in a variety of ways, such as an insertion into a 55 
tumor- or gall-producing plasmid, as bare DNA, as an 
insertion in a plant DNA virus, using A. tumefaciens or A. 
rhizogenes as the transforming agent, protoplast fusion, 
injection, electroporation, etc. For transformation with 
Agrobacterium, plasmids can be prepared in E. coli which 60 
plasmids contain DNA homologous with the Ti-plasmid, 
particularly T-DNA. The plasmid may or may not be capable 
of replication in Agrobacterium, that is, it may or may not 
have a broad spectrum prokaryotic replication system, e.g., 
RK290, depending in part upon whether the transcription 65 
construct is to be integrated into the Ti-plasmid or be 
retained on an independent plasmid. By means of a helper 
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plasmid, the transcription construct may be transferred to the 
A. tumefaciens and the resulting transformed organism used 
for transforming plant cells. 
Alternatively, to enhance integration into the plant 

5 genome, terminal repeats of transposons may be used as 
borders in conjunction with a transposase. In this situation, 
expression of the transposase should be inducible, so that 
once the transcription construct is integrated into the 
genome, it should be relatively stably integrated and avoid 

10 hopping. 

Conveniently, explants may be cultivated with A. tume- 
faciens or A rhizogenes to allow for transfer of the expres- 
sion cassette to the plant cells, the plant cells dispersed in an 
appropriate selective medium for selection, grown to callus, 

15 shoots grown and planttets regenerated from the shoots by 
growing in rooting medium. The Agrobacterium host will 
contain a plasmid having the vir genes necessary for transfer 
of the T-DNA to the plant cells and may or may not have 
T-DNA. For injection and electroporation, disarmed 

20 Ti-plasmids (lacking the tumor genes, particularly the 
T-DNAregion) may be used to introduce genes into the plant 
cell. 

In accordance with the subject invention, an efficient 
procedure is provided for introduction of foreign DNA into 
25 plant cells with integration of the DNA and without gall 
formation, particularly as to plants which previously have 
been reported to be outside the host range of Agrobacteriuin. 
For a list of plant genera and species which are hosts and 
non-hosts for Agrobacterium, see De Qeene and De Ley 
30 The Botanical Review (1976) 42389-466. Of particular" 
interest in the subject invention are dicotyledon legumes, 
such as soybean, and monocotyledon grains, such as corn' 
rice, wheat, barley and oats. 

Where a tumor- or gall-producing plasmid, e.g., the Ri-or 
35 H-piasmid, is to be used to introduce the cassette into the 
plant cell, a binary plasmid, which includes an Agrobacte- 
rium functional replication system, or bacterial mating may 
be employed, whereby the cassette-carrying plasmid is 
transferred from a compatible bacterium to A rhizogenes or 
40 A. tumefaciens and the transconjugant isolated and analyzed 
for integration of the cassette into the Ri- or Tl-plasmid. This 
can be readily determined by various techniques, such as 
Southern analysis. 
The Ti- or Ri-plasmid which is employed should be 
45 capable of providing for integration of T-DNA in the host 
without observable symptoms of tumor or gall formation. 
Thus, the plasmid which is selected may be tumor-producing 
in a conventional host, but will not produce tumors in plants 
normally considered not to be hosts. An ulustrative plasmid 
50 is P TiA6, a wild-type plasmid. The A rhizogenes or A. 
tumefaciens bacteria containing the cassette and the Ri- or 
Ti-plasmid may now be used for transformation of a plant 
host cell. 

For transformation particularly of monocotyledenous or 
55 leguminous plants, the subject method employs in vitro 
grown seedlings between green V-E and V-l (Fehr and 
Caviness, 1977, Stages of Soybean Development Iowa 
State Coop. Ext. Serv., Agric. and Home Econ. Expt. Stn. 
Special Report 80). Thus, young plants, the hypocotyl or 
60 next leaf are employed. The Agrobacterium cells are injected 
into the plant tissue. Generally about 1-5 ul of lxlO 6 to 
lxl0 8 cells/ml will be injected. Injection of Agrobacterium 
into cotyledons, nodes and internodes causes a visible necro- 
sis around the wound site. No tumor formation is observed. 
65 After about one to three weeks, the explants are excised 
from the tissue surrounding the site of injection and subcul- 
tured in a hormone lacking medium. Callus is observed to 
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grow from some of the explants. Opine is present in these 
tissues, while none is detected in non-transformed callus. 

Transformation of seed crops such as Brassica can be by 
any of a variety of methods known to those skilled in the art. 
See, for example, Radke et al. (1988) Theor. Appl. Genet. 5 
75:685-694 and Radke et al. (1992) Plant Cell Reports 
11:499-505. 

The cells which have been transformed may be grown 
into plants in accordance with conventional ways. See, for 
example, McCormick et al., Plant Cell Reports (1986) 10 
5:81-84. These plants may then be grown, and either pol- 
linated with the same transformed strain or different strains, 
and the resulting hybrid having the desired phenotypic 
characteristic identified. Two or more generations may be 
grown to ensure that the subject phenotypic characteristic is 15 
stably maintained and inherited and then fruits or seeds 
harvested to ensure the desired phenotype or other property 
has been achieved. 

As a host cell, any plant variety may be employed which 
provides a plant part or tissue of interest. For example, 20 
where the plant tissue of interest is seed, for the most part, 
plants will be chosen where the seed is produced in high 
amounts or a seed-specific product of interest is involved. 
Seeds of interest include the oil seeds, such as the Brassica 
seeds, cotton seeds, soybean, safflower, sunflower, or the 25 
like; grain seeds, e.g., wheat, barley, rice, clover, corn, or the 
like. 

Where the plant part is a fruit, any of a number of fruit 
bearing plants may be employed in which the plant parts of 
interest are derived from the ovary wall. These include true 30 
berries such as tomato, grape, blueberry, cranberry, currant, 
and eggplant; stone fruits (drupes) such as cherry, plum, 
apricot peach, nectarine and avocado; compound fruits 
(droplets) such as raspberry and blackberry. In hesperidium 
(oranges, citrus), the expression cassette might be expected 35 
to be expressed in the "juicy** portion of the fruit In pepos 
(such as watermelon, cantaloupe, honeydew, cucumber and 
squash) the equivalent tissue for expression is most likely 
the inner edible portions, whereas in legumes (such as peas, 
green beans, soybeans) the equivalent tissue is the seed pod. 40 

By use of transcription initiation regions from regulated 
genes, it is found mat expression of a structural gene of 
interest, either sense or antisense, can be regulated in a 
manner similar to the regulation of the gene native to the 
transcription initiation region. For example, by use the 45 
soybean SSU promoter, the expression of a gene under the 
control of this promoter is induced by light Thus, the 
expression of the gene is regulatable, where enhanced 
expression occurs during irradiation with light, while sub- 
stantially reduced expression or no expression occurs in the so 
absence of light. Similarly, transcription initiation regions 
from genes expressed preferentially in seed or fruit tissues 
may be used to control of expression of desired DNA 
sequences in these plant tissues. 

By virtue of having a regulatable promoter in the soybean 55 
plant, one can provide for protection against herbicides, by 
providing a herbicide-resistant gene to be under the regu- 
latable control of the SSU promoter. For example, by 
employing a mutated aroA gene, the enzyme 5-enolpyruvyl- 
3-phosphoshikimate synthase which is. glyphosate-resistant 60 
can be produced under light induction. Thus, the soybean 
plant may be protected from glyphosate, allowing for the 
killing of weeds employing the glyphosate herbicide. While 
glyphosate may be used by itself, particularly for pre- 
emergent spraying and post-emergent control of weeds, the 65 
glyphosate may also be used with other post-emergent 
broadleaf herbicides, such as Basagran (bentazan), Tackle/ 
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Blazer (aciflurofen). Normally, applications will vary from 
about 1.25 to 1.5 lbs/acre, where the herbicides may be 
formulated as dry or wet formulations, by themselves or in 
combination with other additives, such as sticking agents, 

5 spreading agents, stabilizers, or the like. Inert powders may 
be used with dry formulations. 

A transcriptional initiation region may be used for varying 
the phenotype of the seeds. Various changes in phenotype 
are of interest These include modifying the fatty acid 

10 composition in seeds, that is changing the ratio and/or 
amounts of the various fatty acids, as to length, unsaturate on, 
or the like. Thus, the fatty acid composition may be varied 
by enhancing the fatty acids of from 10 to 14 carbon atoms 
as compared to the fatty acids of from 16 to 18 carbon atoms, 

15 increasing or decreasing fatty acids of from 20 to 24 carbon 
atoms, providing for an enhanced proportion of fatty acids 
which are saturated or unsaturated, or the like. These results 
can be achieved by providing for reduction of expression of 
one or more endogenous products, particularly enzymes or 

20 cofactors, by producing a transcription product which is 
complementary to the transcription product of a native gene, 
so as to inhibit the maturation and/or expression of the 
transcription product, or providing for expression of a gene, 
either endogenous or exogenous, associated with fatty acid 

25 synthesis. Expression products associated with fatty acid 
synthesis include acyl carrier protein, acyl-ACP 
thioesterase, acetyl-CoA ACP transacylase, acetyl-CoA 
carboxylase, ketoacyl-ACP synthases, malonyl-CoA ACP 
transacylase, stearoyl-ACP desaturase, and other desaturase 

30 enzymes. 

A transcriptional initiation region may be employed for 
varying the phenotype of the fruit. Various changes in 
phenotype are of interest These changes may include up- or 
down-regulation of formation of a particular saccharide, 

35 involving mono- or polysaccharides, involving such 
enzymes as polygalacturonase, levansucrase, 
dextransucrase, invertase, etc; enhance lycopene biosynthe- 
sis; cytokinin and monellin synthesis. Other properties of 
interest for modification include response to stress, 

40 organisms, herbicides, bruising, mechanical agitation, etc., 
change in growth regulators, organoleptic properties, eta 
For antisense or complementary sequence transcription, the 
sequence will usually be at least 12, more usually at least 16 
nt. Antisense sequences of interest include those of 

45 polygalacturonase, sucrose synthase and invertase. 

Alternatively, one may provide various products from 
other sources including mammals, such as blood factors, 
lymphokines, colony stimulating factors, interferons, plas- 
minogen activators, enzymes, e.g. superoxide dismutase, 

so chymosin, etc., hormones, rat mammary thioesterase 2, 
phospholipid acyl desaturases involved in the synthesis of 
eicosapentaenoic acid, and human serum albumin. The level 
of seed proteins, particularly mutated seed proteins, having 
an improved amino acid distribution which would be better 

55 suited to the nutrient value of the seed can also be increased. 
This can be achieved, for example, by inhibition of the 
native seed protein by producing a complementary DNA 
sequence to the native coding region or non-coding region, 
where the complementary sequence does not hybridize 

60 efficiently to the mutated sequence, or inactivates the native 
transcriptional capability. 

A protein is provided having the sequence described in the 
Experimental section designated as 2A11. This protein could 
be a storage protein and be useful in enhancing sulfur 

65 containing amino acids (cysteine and methionine) in the 
diet. It can be obtained in substantially pure form by 
providing for expression in prokaryotes or eukaryotes, e.g.. 
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yeast by inserting the open reading frame into an expression 
cassette containing a transcriptional initiation region. A 
variety of expression cassettes are commercially available or 
have been described in the literature. See, for example, U.S. 
Pat. Nos. 4,532,207; 4,546,082; 4,551,433; and 4,559302. 5 
The product, if intracellular, may be isolated by lysing of the 
cells and purification of the protein using electrophoresis, 
affinity chromatography, HPLC extraction, or the like. The 
product may be isolated in substantially pure form free of 
other plant products, generally having at least about 95% 10 
purity, usually at least about 99% purity. 

The following examples are offered by way of illustration 
and not by limitation. 

EXAMPLES 15 

Cloning Vectors Cloning vectors used include the pUC 
vectors, pUC8 and pUC9 (Vieira and Messing, Gene (1982) 
19:259-268); pUC18 and pUC19 (Norrander et al. Gene 
(1983) 26:101-106; Yanisch-Perron et al., Gene (1985) 
33:103-119), and analogous vectors exchanging chloram- 20 
phenicol resistance (CAM) as a marker for the ampicillin 
resistance of the pUC plasmids described above (pUC-CAM 
[pUC12-Cm, pUC13-Cm] Buckley, IC, Ph.D. Thesis, 
U.C.S.D., CA 1985). The multiple cloning sites of pUC18 
and pUC19 vectors were exchanged with those of pUC- 25 
CAM to create pCGN565 and pCGN566 which are CAM 
resistant Also used were pUC118 and pUC119, which are 
respectively, pUC18 and pUC19 with the intergenic region 
of M13, from an HgiAI site at 5465 to the AhalU site at 
5941, inserted at the Ndel site of pUC (available from Vieira 30 
J. and Messing, J. Waksman Institute, Rutgers University, 
Rutgers, NJ.) 

Materials Terminal deoxynucleotide transferase (TDTh 
RNaseH, E. coli DNA polymerase, T4 kinase, and restriction 
enzymes were obtained from Bethesda Research Laborato- 
ries; E. coli DNA ligase was obtained from New England 
Biolabs; reverse transcriptase was obtained from Life 
Sciences, Inc.; isotopes were obtained from Amersham; 
X-gal was obtained from Bachem, Inc., Torrance, Calif. ^ 

Bacterial strains, plasmids, and media E. coli strains 
MM294(F endAl hsdR17 supE44 thTl) (Meselson and 
Yuan, Nature (1968) 217:1110-1114) and 71-18 (Alac- 
proAB) supE thi FlacF Z M15 proA^B*) (Messing et al., 
Proc. Natl. Acad Sci. U.SA. (1977) 74:3642-3646) were 45 
routinely used for transformations. A. tumefacicns A348 
contains the octopine Ti-plasmid pTiA6 in A114 (Garfinkel 
and Nester, J. BacterioL (1980) 144:732-743). pRK2073 
was maintained inHBlOl (F- hsd'20 (r fl -r m -) xccA13 proA2 
lacYl leuB6 rpsL20 thi 1 supE44) (Boyer and Rouland- 50 
Dussiox, J. Mol. Biol (1969) 41:459). 

Plasmid pRK2073 was generated by insertion of Tn7 into 
the Kan r gene of pRK2013. (Ditta et al., Proc. Natl. Acad 
Sci. USA. (1980) 76:1648-1652) pSR2.1 (Berry-Lowe et 
al., 1982, supra.) contains a 2. 1 EcoRI fragment of a soybean 55 
small subunit gene (SSU) in pBR325. The Baml9 fragment 
of pTiA6 was maintained as a 4.6kb subclone in pBR325 
(pNM33C-19-l) (Thomashow et al., Cell (1980) 
19:729-739). pCGN464 contained the 1.5 kb HindHI-SalsI 
fragment of Tn5 cloned into the sp6 transcription vector 60 
pSP65 (Melton et al., Nucl. Acids Res. (1984) 
12:7035-7056). The pUC7 recombinant vector containing 
the 1.0 kb BglE-Smal fragment of Tn5is designated 
pCGN546. 

E. coli were grown on LB media (Miller, 1972, Experi- 65 
ments in Molecular Genetics, CSH Laboratory, Cold Spring 
Harbor, N.Y.). A. tumefaciens were grown in either minimal j 
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AB medium (Chilton et al., Proc. Natl. Acad. Set. U.S.A. 
(1974) 71:3672-3676) or in MG/L (50% LB:50% mannitol 
glutamate medium (Roberts and Kerr, Physiol. Plant Pathol 
(1974) 4:81-91. 

5 E. coli strain pCGN1299x7118 was deposited with the 
American Type Culture Collection (A.T.C.C.), 12301 
Parklawn Drive, Rockville, Md., 20852 on May 21, 1987 
and given Accession No. 67408. 

io Example 1 

Preparation of Transformed Soybean Plants 
Soybean (glycine max cv "forrest") seeds were surface 
sterilized (12 min, 5% sodium hypochlorite, 0.1% Tween 

15 80), washed 3 times in distilled water and germinated 
aseprtically (1/10 MS-Gibco, 0.6% phytagar (Gibco) 
medium without hormones, 25° C. red light (Grolux 40W)). 
Agrobacterium containing strains pTiCGN327 and 
pTiCGN609 were grown overnight (MG/L medium 30° C.) 

20 were injected into hypocotyl, cotyledons, node and intern- 
ode of two to three week old seedlings. Three weeks after 
injection, tissues surrounding the injection site were excised 
and placed on 0.6% phytagar MS medium deprived of 
hormones and containing 0.5 g/L carbenicillin. Hormone 

25 independent, octopine positive tissues were then transferred 
to liquid MS medium and analyzed for the presence of 
octopine (Otten and Schilperoot, Biochenu etBiophys. Acta 
1978) 527:497-500). To determine kanamycin resistance, 
growing calli were then placed in light or complete darkness. 

30 Friable calli of light grown or dark grown 327 and 609 were 
disaggregated by filtering through a 105 u nylon mesh. 
Samples (0.1 ml packed cell volume (p.c.v.)) of fine sus- 
pensions (1-15 cells/clump were placed in the same medium 
containing 0 to 300 mg/L kanamycin- Pigmented cells were 

35 kept in the light while the non-pigmented cells were kept in 
total darkness. The effects of kanamycin on growth were 
measured as packed cell volume six weeks later. 
DNA Isolation 

The alkali-lysis procedure of Ish-Horowitz (Maniatis et 
40 aL, 1982 Molecular Cloning, A Laboratory Manual CSH 
Laboratory, Cold Spring Harbor, N.Y.) was used for both 
large-scale plasmid isolation and for mini-prep analysis. 
Total DNA from A. tumefaciens was prepared as described 
(Currier and Nester, /. BacterioL (1976) 126:157-165. 
45 DNA fragments were isolated from low melt agarose gels 
(Sea Plaque) run in TAE buffer (0.04M Tris-acetate, 0.002M 
EDTA (Maniatis, supra.) without ethidium bromide. The 
desired fragment was extracted from the excised agarose 
band by melting at 65° C. for 30min followed by phenol 
50 extraction and ethanol precipitation. 
Cloning Procedures 

Restriction enzyme digestions and ligations were per- 
formed according to manufacturer's instructions. Klenow 

55 fill-in reactions and transformation were as described 
(Manniatis, supra.) When pUC9 was being transformed into 
E. coli strain 71-18, X-Gal and HTG were added to the 
plates as described (Miller, 1972, supra). Correct insertion 
and orientation of recombinants were verified by 2 to 3 

^ restriction digests. 

The verification of the Smal-Ddel junction in 0CGN6O6 
was done by cloning the l.lkb BamHI-EcoRI fragment into 
M13mp9 (Maniatis, supra). Sequence analysis was then 
performed in accordance with conventional ways. 

65 Agrobacterium matings 

The pCGN609 construct was integrated into the plas- 
midTi pTiA6 in a three-way mating (Comai et al., 1983, 
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Plasmid 10:21-30). Overnight E. Coli strains containing 
cultures of pCGN609 and pRK2073, respectively, were 
mixed with A. tumefaciens strain A722 and spread on AB 
plates containing 150 jig/ml kanamycin and 250 ug/ml 
streptomycin. Single colonies were restreaked twice. Cor- 5 
rect integration was verified by Southern analysis of total 
Agrobacterium DNA. BamHI digested DNA was probed 
with a nick-translated 2.5 Pstl-EcoRI 3' ocs fragment from 
pCGN607. Southern analysis and nick translation were 
performed in accordance with conventional ways. 10 
RNA preparation and Northern blot analysis 
RNA was prepared from soybean callus by a modification 
of the guanidine thiocyanate procedure of Colbert et al. 
{Proc. Natl. Acad Sci. U.S.A. (1983) 80:2248-2252) in 
which the extraction buffer contain 4M guanidine 15 
thiocyanate, 2% lauryl sarcosine, 1% p-raercaptoethanol, 50 
mM Tris, pH 7.5 20 mM EDTA, 1 mM aurintricarboxylic 
acid, 0.4% antif oam A (Sigma). PolyA + RNA was purified 
over oligo-dT cellulose (Maniatis, supra) and Northern gels 
run as previously described (Shewrnaker et al., 1984, supra). 20 
32 p-RNA bacterial aminoglycoside phosphotransferase 
rnRNA complementary to (APH(3 , )II-mRNA) (Herrera- 
Estrella et al., EMBO J. (1983) 2:987-995; Bevan et al., 
Nature (1983) 304:184-187) was synthesized from Bgin cut 
pCGN464 using a riboprobe kit (RroMega Biotech) accord- 25 
ing to the manufacturer's instructions. The hybridization 
buffers were as suggested by the riboprobe manufacturer's 
with hybridization at 55° C. and washes at 60° C 

Kanamycin activity blots ^ 
The kanamycin activity blots (Reiss et al.. Gene (1984) 
30:211) were performed as modified for plants (Schreier et 
al., EMBO J. (1985) 4(l):25-32). For each sample, 0.2 g of 
fresh soybean callus was used. 

Construction of soybean ssu-Kan r chimera 35 
A soybean SSU gene (Berry-Lowe), 1982, supra) was 
chosen as the source of the 5 -promoter region. In this gene 
there is a Ddel site, 9 pb upstream of the AUG. A Ddel digest 
of pSRS2.1 (Berry-Lowe, 1982, supra.) yielded a 1.1 kb 5* 
fragment isolated out of a low melt agarose geL The 5' 1.1 40 
kb Ddel fragment was filled in with Klenow polymerase and 
ligated into Smal digested pUC9 (Vieira and Messing, Gene 
(1982) 19:259). A clone, pCGN606 was obtained that had 
the SSU promoter facing the adjacent EcoRI site of pUC9. 

A cassette containing the soybean 5* region and an appro- 45 
priate 3* region was men constructed. For this cassette, the 
octopine synthetase (ocs) 3' region was chosen as a 2.5 kb 
EcoRI-PstI fragment from a Baml9 subclone of pTiA6 
(Thomashow, 1980, supra). Since it contained regions 
homologous to T-DNA, it would facilitate transfer to the 50 
Uplasmid of Agrobacterium, The cassette pCGN607 was 
obtained in a 3- way ligation with this fragment, the 1.1 kb 
BamHI EcoRI 5* soybean SSU fragment from pCGN606, 
and the 2.7 kb BamHI-PstI fragment of pACYC177 (Chang 
and Cohen, J. Bacterial. (1978) 134:1141). 55 

The APH(3>II gene employed was from Tn5, which 
confers resistance to kanamycin both in bacteria (Haas and 
Dowding, MetK Enzymology (1975) 43:611-628) and plants 
(Herrera-Estrella, 1983, supra). A 1.0 kb Bglll-Smal frag- 
ment containing the gene was cloned into pUC7 resulting in 60 
adjacent flanking EcoRI restriction sites. The plasmid was 
digested to provide a 1.0 kb EcoRI fragment and this 
fragment ligated into EcoRI digested pCGN607. Clones 
were screened for those carrying the Kan r gene of Tn5 in the 
correct orientation. One of the clones which had the correct 65 
orientation was designated pCGN609. The plasmid also 
carried the kanamycin resistance gene from pACYC177 
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(APH(3')-I as a bacterial marker. These two kanamycin 
resistance genes (APH(3')-I and -II) do not cross-hybridize 
at the nucleic acid level. 
Following Klenow polymerase fill-in, only 9 bp which are 
5 present upstream of the AUG in native soybean SSU are 
lacking in pCGN609. These 9 bp are replaced with 46 bp 
that arise from the fusion manipulations. The rest of the 1.1 
kb soybean SSU 5 'region is the same in pCGN609 as in 
native soybean. 

10 The integration of pCGN609 into the Ti-plasmid pTiA6 
was accomplished in a three-way (Comai et al., 1983) 
mating with pRK2073. Correct integration was verified by 
Southern analysis of the resulting Agrobacterium, desig- 
nated pTiCGN609. In the integration an intact octopine 
synthetase region is maintained as evidenced by the detec- 
tion of octopine. Octopine was detected by fluorescence of 
its phenanthroquinone adduct following paper electrophore- 
sis of tissue extracts (10 nig). 

20 Transformation of soybean 

Transformation of soybean was performed on in vitro 
grown seedlings from the time their cotyledons turned green 
up to the time of the appearance of the second intemode. In 
every case, the injection of Agrobacterium caused a clearly 

25 visible necrosis around the wound site. Occasionally, after 1 
to 3 weeks, roots would appear at the inoculation site. 
Splitting also occurred, revealing swollen tissue, but in no 
case was tumor noted with the Agrobacterium strains used. 
Explants excised from the tissue surrounding the site of 

30 injection were subcultured in MS medium deprived of 
hormones, 0.6% phytagar, 0.5 g/L carbenicillin. Hormone- 
independent callus grew from some of the ex plants. 
Hormone-independent growing tissue analyzed for the pres- 
ence of octopine was positive, while no octopine was 

35 detected in non-transformed soybean tissue. All aerial parts 
of the soybean seedlings, cotyledons, internodes, and nodes, 
were able to produce transformed tissue although no sys- 
tematic study was done to determine which of these areas is 
most susceptible to Agrobacterium. 

40 Analysis of PolyA + RNA in light and dark crown tissue 

The increase in SSU protein seen in a number of light 
grown plants was shown to correlate with an increase in the 
level of SSU polyA + RNA. Northern analysis of light and 
dark grown 609 soybean callus was performed to determine 
45 if an increase in APH(J)-II polyA + RNA occurred with 
growth in light The results were determined with a 32 P- 
RNA probe specific for APH(3')-II transcript in the sense 
orientation. An RNA of the expected size of approximately 
1.6 kb was seen in both cases of light and dark. Approxi- 
mately 5-10 times as much transcript was seen in the light 
grown tissue as the dark grown tissue. 

Presence of protein with kanamycin phosphotransferase 
activity 

55 APH(3>n (aminoglycoside phosphotransferase) inacti- 
vates kanamycin by phosphorylation. The presence of this 
activity can be demonstrated by a number of assays which 
measure the phosphorylation of kanamycin in vitro. In the 
assay employed (Reiss et al., Gene (1984) 30:211) extracts 

60 are run on an acrylamide gel, reacted in situ with kanamycin 
and y- 32 P-ATP and then blotted to P81 (phosphocellulose) 
paper. For green (light grown) and white (dark grown) 609 
soybean callus, activity was seen in the green soybean at the 
same mobility as that observed for purified APH(3>H, 

65 while no detectable activity was seen in white 609 tissue or 
in soybean transformed with an Agrobacterium lacking the 
APH(3>n gene. 
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Demonstration of kanamycin resistance in the trans- 
formed tissue 

Greening of the soybean callus occurred spontaneously 
after exposure to light. Some of the green 609 callus selected 
for its friability was disaggregated as described previously 5 
and used to analyze its resistance to kanamycin. It was 
compared to similar non-pigmented tissue grown in com- 
plete darkness. Dark grown 609 as well as control 327 tissue 
died in the presence of 50 mg/L kanamycin, while the light 
grown tissue could survive up to 300 mg/L kanamycin 
although its growth was slightly inhibited at this concentra- 
tion. 
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Example 2 

Construction of a Napin Promoter 

There are 298 nucleotides upstream of the ATG start 
codon of the napin gene on the pgNl clone, a 33 kb EcoRI 
fragment of B. napus genomic DNA containing a napin gene 
cloned into pUC8 (available from Marti Crouch, University ^ 
of Indiana). pgNl DNA was digested with EcoRI and SstI 
and ligated to EcoRI/SstI digested pCGN706. (pCGN706 is 
an Xhol/Pstl fragment containing 3' and polyadenylation 
sequences of another napin cDNA clone pN2 (Crouch et al., 
1983 supra) cloned in pCGN566 at the Sail and PstI sites.) 
The resulting clone pCGN707 was digested with Sail and 
treated with the enzyme Bal31 to remove some of the coding 
region of the napin gene. The resulting resected DNA was 
digested with Smal after the Bal31 treatment and religated. 
One of the clones, pCGN713, selected by size, was sub- ^ 
cloned by EcoRI and BamHI digestion into both EcoRI- 
BamHI digested pEMBL18 (Dente et aL, Nucleic Acids Res, 
(1983) 11:1645-1655) andpUC118 to give E418 andE4118 
respectively. The extent of Bal31 digestion was confirmed by 
Sanger dideoxy sequencing of E418 template. The Bal31 35 
deletion of the promoter region extended only to 57 nucle- 
otides downstream of the start codon, thus containing the 5* 
end of the napin coding sequence and about 300 bp of the 5* 
non-coding region. E4118 was tailored to delete all of the 
coding region of napin including the ATG start codon by in 
vitro mutagenesis by the method of Zollex and Smith 
(Nucleic Acids Res, (1982) 10:6487-6500) using an oligo- 
nucleotide primer 
5-GATGTTTTGTATGTGGGCCCCTAGGAGArC-3'. 
Screening for the appropriate mutant was done by two 45 
transformations into E, coli strain JM83 (Messing J., In: 
Recombinant DNA Technical Bulletin, NTH Publication No. 
79-99, 2 No. 2, 1979, pp 43^8) and Smal digestion of 
putative transformants. The resulting napin promoter clone 
is pCGN778 and contains 298 nucleotides from the EcoRI 50 
site of pgNl to the A nucleotide just before the ATG start 
codon of napin. The promoter region was subcloned into a 
chloramphenicol resistant background by digestion with 
EcoRI and BamHI and ligation to EcoRI-BamHI digested 
pCGN565 to give pCGN779c. 

Extension of the Napin Promoter Clone 

pCGN779c contains only 298 nucleotides of potential 
5'-regulatory sequence. The napin promoter was extended 
with a 1.8 kb fragment found upstream of the 5-EcoRI site 
on the original XBnNa clone. The -3.5 kbXhoI fragment of 60 
ABnNa (available from M. Crouch), which includes the 
napin region, was subcloned into S all-digested pUC119 to 
give pCGN930. A HindHI site close to a 5' Xhol site was 
used to subclone the HindHI-EcoRI fragment of pCGN930 
into HindHI-EcoRI digested Bluescript + (Vector Cloning 65 
Systems, San Diego, Calif.) to give pCGN942. An extended 
napin promoter was made by ligating pCGN779c digested 
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with EcoRI and PstI and pCGN942 digested with EcoRI and 
PstI to make pCGN943. This promoter contains -2.1 kb of 
sequence upstream of the original ATG of the napin gene 
contained on ABnNa. A partial sequence of the promoter 
5 region is shown in FIG. 1. 
Napin Cassettes 

The extended napin promoter and a napin 3 '-regulatory 
region are combined to make a napin cassette for expressing 
genes seed-specifically. The napin 3 -region used is from the 

10 plasmid pCGN1924 containing the XhoI-EcoRI fragment 
from pgNl (Xhol site is located 18 nucleotides from the stop 
codon of the napin gene) subcloned into EcoRI-Sall digested 
pCGN565. Hindm-Pstl digested pCGN943 andpCGN1924 
are ligated to make the napin cassette pCGN944, with 

15 unique cloning sites SmaL Sail, and PstI for inserting genes. 
Construction of cDNA Library from Spinach Leaves 
Total RNA was extracted from young spinach leaves in 
4M guanidine thiocyanate buffer as described by Facciotti et 

20 aL (Biotechnology (1985) 3:241-246). Total RNA was sub- 
jected to oligo(dT>cellulose column chromatography two 
times to yield poly(A) + RNA as described by Maniatis et al., 
(1982) Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory, New York. A cDNA library was 

25 constructed in pUC13-Cm according to the method of 
Gubler and Hofl&nan, (Gene (1983) 25:263-269) with slight 
modifications. RNasin was omitted in the synthesis of first 
strand cDNA as it interfered with second strand synthesis if 
not completely removed, and dCTP was used to tail the 

3Q vector DNA and dGTP to tail double-stranded cDNA instead 
of the reverse as described in the paper. The annealed cDNA 
was transformed to competent E. coli JM83 (Messing (1979) 
supra) cells according to Hanahan (J. Mol. Biol. (1983) 
166:557-580) and spread onto LB agar plates (Miller (1972) 

35 Experiments in Molecular Genetics. Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York) containing 50 
ug/ml chloramphenicol and 0.005% X-GaL 
Identification of Spinach ACP-I cDNA 
A total of approximately 8000 cDNA clones were 

40 screened by performing Southern blots (Southern, /. MoL 
Biol (1975) 98:503) and dot blot (described below) hybrid- 
izations with clone analysis DNA from 40 pools represent- 
ing 200 cDNA clones each (see below). A 5' end-labeled 
synthetic oligonucleotide (ACPP4) that is at least 66% 

45 homologous with a 16 amino acid region of spinach ACP-I 
(S'-GArGTCTTGAGCCTTGTCCrCArCCACArrGArA 
CCAAACTCCTCCTC-3*) is the complement to a DNA 
sequence that could encode the 16 amino acid peptide 
glu-glu-glu-phe-gly-ile-asn-val-asp-glu-asp-lys-ala-gln-asp- 

so ile, residues 49-64 of spinach ACP-I (Kuo and Ohlrogge, 
Arch, BiochettL Biophys. (1984) 234:290-296) and was used 
for an ACP probe. 

Clone analysis DNA for Southern and dot blot hybridiza- 
tions was prepared as follows. Transformants were trans- 

55 ferred from agar plates to LB containing 50 ug/ml chloram- 
phenicol in groups of ten clones per 10 ml media. Cultures 
were incubated overnight in a 37° C. shaking incubator and 
then diluted with an equal volume of media and allowed to 
grow for 5 more hours. Pools of 200 cDNA clones each were 

60 obtained by mixing contents of 20 samples. DNA was 
extracted from these cells as described by Birnboim and 
Doly (Nucleic Acids Res. (1979) 7:1513-1523). DNA was 
purified to enable digestion with restriction enzymes by 
extractions with phenol and chloroform followed by ethanol 

65 precipitation. DNA was re suspended in sterile, distilled 
water and 1 \ig of each of the 40 pooled DNA samples was 
digested with EcoRI and HindHI and electrophoresed 
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through 0.7% agarose gels. DNA was transferred to nitro- 
cellulose filters following the blot hybridization technique of 
Southern. 

ACPP4 was 5' end-labeled using f 2 p DATP and T4 
kinase according to the manufacturer's specifications. Nitro- 5 
cellulose filters from Southern blot transfer of clone analysis 
DNA were hybridized (24 hours, 42° C.) and washed 
according to Berent et al. {BioTechniques (1985) 
3:208-220). Dot blots of the same set of DNA pools were 
prepared by applying 1 ug of each DNA pool to nylon io 
membrane filters in 0.5M NaOH. These blots were hybrid- 
ized with the probe for 24 hours at 42° C. in 50% 
formamide/1% SDS/1M NaCL, and washed at room tem- 
perature in 2x SSC/0.1% SDS (lx SSC=0.15M NaCl; 
0.015M Na citrate; SDS-sodium dodecylsulfate). DNA from 15 
the pool which was hybridized by the ACPP4 oligoprobe 
was transformed to JM83 cells and plated as above to yield 
individual transformants. Dot blots of these individual 
cDNA clones were prepared by applying DNA to nitrocel- 
lulose filters which were hybridized with the ACPP4 oligo- 20 
nucleotide probe and analyzed using the same conditions as 
for the Southern blots of pooled DNA samples. 

Nucleotide Sequence Analysis 

The positive clone, pCGNlSOL, was analyzed by diges- 
tion with restriction enzymes and the following partial map 25 
was obtained. 

pUC13-Cm 1-35-1 248 |-63-| 152 | -200 | 

30 



I I I I I I I 

H H N P Xh E SXBSmSsE** 

H-HindDl N-Ncol P-PvuH Xh-Xhol 
E-EcoRI S-SaU X-Xbal Sm-Smal 
B-BamHI Ss-SstI *- former PstI site destroyed 
with failing 

**poly linker with available restriction sites indicated 
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The cDNA clone was subcloned into pUC118 and 
pUC119 using standard laboratory techniques of restriction, 
ligation, transformation, and analysis (Maniatis et al., (1982) 
supra). Single-stranded DNA template was prepared and 
DNA sequence was determined using the Sanger dideoxy 
technique (Sanger et al., Proc. Nat. Acad. ScL US.A (1977) 
74:5463-5467). Sequence analysis was performed using a 
software package from InteliiGenetics, Inc. 

pCGNlSOL contains an (approximately) 700 bp cDNA 
insert including a stretch of A residues at the 3* terminus 
which represents the poly(A) tail of the mRNA. An ATG 
codon at position 61 is presumed to encode the MET 50 
translation initiation codon. This codon is the start of a 411 
nucleotide open reading frame, of which, nucleotides 
229-471 encode a protein whose amino acid sequence 
corresponds almost perfectly with the published amino acid 
sequence of ACP-I of Kuo and Ohlrogge supra as described 55 
previously. In addition to mature protein, the pCGNlSOL 
also encodes a 56 residue transit peptide sequence, as might 
be expected for a nuclear-encoded chloroplast protein. 

Napin-ACP Construct 

pCGN796 was constructed by ligating pCGNlSOL 60 
digested with Hindm-BamHL pUC8-CM digested with 
HindHI and BamHI and pUC118 digested with BamHL The 
ACP gene from pCGN796 was transferred into a chloram- 
phenicol background by digestion with BamHI and ligation 
with BamHI digested pCGN565. The resulting pCGN1902 65 
was digested with EcoRI and Smal and ligated to EcoRI- 
Smal digested pUC118 to give pCGN1920. The ACP gene 
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in pCGN1920 was digested at the Ncol site, filled in by 
treatment with the Klenow fragment, digested with Smal 
and religated to form pCGN1919. This eliminated the 
5'-coding sequences from the ACP gene and regenerated the 
5 ATG. This ACP gene was flanked with PstI sites by digesting 
pCGN1919 with EcoRI, filling in the site with the Klenow 
fragment and ligating a PstI linker. This clone is called 
pCGN945. 

The ACP gene of pCGN945 was moved as BamHI-PstI 

10 fragment to pUC 118 digested with BamHI and PstI to create 
pCGN945a so that a Smal site (provided by the pUC118) 
would be at the 5'-end of the ACP sequences to facilitate 
cloning into the napin cassette pCGN944. pCGN945a 
digested with Smal and PstI was ligated to pCGN944 
digested with Smal and PstI to produce the napin ACP 

15 cassette pCGN946. The napin ACP cassette was then trans- 
ferred into the binary vector pCGN783 by cloning from the 
Hindm site to produce pCGN948. 

Construction of the Binary Vector pCGN783 

2Q pCGN783 is a binary plasmid containing the left and right 
T-DNA borders of A. tumefaciens (Barker et aL, Plant MoL 
BioL (1983) 2:335-350); the gentamicin resistance gene of 
pPHlJI (Hirsch et aL, Plasmid (1984), 12:139-141) the 35S 
promoter of cauliflower mosaic virus (CaMV) (Gardner et 

25 aL, Nucleic Acids Res. (1981) 9:2871-2890), the kanamycin 
resistance gene of Tn5 (Jorgenson et aL, infra and Wolff et 
aL, Nucleic Acids Res. (1985) 13:355-367) and the 3' region 
from transcript 7 of pTiA6 (Barker et aL, (1983) supra). 
To obtain the gentamicin resistance marker, the gentami- 

30 cin resistance gene was isolated as a 3.1 kb EcoRI-PstI 
fragment of pPHTJl cloned into pUC9 yielding pCGN549. 
The Hindm-BamHI fragment containing the gentamicin 
resistance gene was substituted for the HindHI-Bgin frag- 
ment of pCGN587 creating pCGN594. 

35 pCGN587 was prepared as follows: The HinduT-Smal 
fragment of Tn5 containing the entire structural gene for 
APHH (Jorgenson et aL, MoL Gen. Genet (1979) 177:65) 
was cloned into pUC8 (Vieira and Messing, Gene (1982) 
19:259), converting the fragment into a HindJU-EcoRI 

40 fragment, since there is an EcoRI site immediately adjacent: 
to the Smal site. The Pstl-EcoRI fragment containing the 
3'-portion of the APHII gene was then combined with an 
EcoRI-BamHI-Sall-Pstl linker into the EcoRI site of pUC7 
(pCGN546W). Since this construct does not confer kana- 

45 mycin resistance, kanamycin resistance was obtained by 
inserting the BgLII-PstI fragment of the APHII gene into the 
BamHI-PstI site (pCGN546X). This procedure reassembles 
the APHH gene, so that EcoRI sites flank the gene. An ATG 
codon was upstream from and out of reading frame with the 

5 0 ATG initiation codon of APHH The undesired ATG was 
avoided by inserting a Sau3A-PstI fragment from the 5'-end 
of APHH, which fragment lacks the superfluous ATG, into 
the BamHI-PstI site of pCGN546W to provide plasmid 
pCGN550. 

55 The EcoRI fragment containing the APHH gene was then 
cloned into the unique EcoRI site of pCGN451, which 
contains an octopine synthase cassette for expression, to 
provide pCGN552 (1ATG). 

pCGN451 includes an octopine cassette which contains 

60 about 1556 bp of the 5* non-coding region fused via an 
EcoRI linker to the 3' non-coding region of the octopine 
synthase gene of pTiA6. The pTi coordinates are 11,207 to 
12,823 for the 3* region and 13,643 to 15,208 for the 5' 
region as defined by Barker et aL, Plant MoL BioL (1983) 

65 supra. 

The 5' fragment was obtained as follows. A small sub- 
cloned fragment containing the 5' end of the coding region. 
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as a BamHI-EcoRI fragment was cloned in pBR322 as 
plasmid pCGN407. The BamHI-EcoRI fragment has an 
XmnI site in the coding region, while pBR322 has two XmnI 
sites. pCGN407 was digested with XmnI, resected with 
Bal31 nuclease and EcoRI linkers added to the fragments. 5 
After EcoRI and BamHI digestion, the fragments were size 
fractionated the fractions cloned and sequenced. In one 
case, the entire coding region and 10 bp of the 5 1 non- 
translated sequences had been removed leaving the 5' non- 
translated region, the mRNA cap site and 16 bp of the 5' 10 
non-translated region (to a BamHI site) intact This small 
fragment was obtained by size fractionation on a 7% acry- 
lamide gel and fragments approximately 130 bp long eluted. 

This size fractionated DNA was ligated into M13mp9 and 
several clones sequenced and the sequence compared to the 15 
known sequence of the octopine synthase gene. The M13 
construct was designated pl4, which plasmid was digested 
with BamHI and EcoRI to provide the small fragment which 
was ligated to a Xhol to BamHI fragment containing 
upstream 5' sequences from pTiA6 (Garfinkel and Nester, J. 20 
Bacteriol. (1980) 144:732) and to an EcoRI to Xhol frag- 
ment containing the 3' sequences. 

The resulting Xhol fragment was cloned into the Xhol site 
of a pUC8 derivative, designated pCGN426. This plasmid 
differs from pUC8 by having the sole EcoRI site filled in 25 
with DNA polymerase I, and having lost the PstI and Hindm 
site by nuclease contamination of Hindi restriction 
endonuclease, when a Xhol linker was inserted into the 
unique HincH site of pUC8. The resulting plasmid 
pCGN45 1 has a single EcoRI site for the insertion of protein 30 
coding sequences between the 5' non-coding region (which 
contains 1,550 bp of 5* non-transcribed sequence including 
the right border of the T-DNA, the MRNA cap site and 16 
bp of 5' non-translated sequence) and the 3' region (which 
contains 267 bp of the coding region, the stop codon, 196 bp 35 
of 3* non-translated DNA, the polyA site and 1,153 bp of 3' 
non-transcribed sequence). pCGN451 also provides the right 
T-DNA border. 

The resulting plasmid pCGN45 1 having the ocs 5' and the m 
ocs 3* in the proper orientation was digested with EcoRI and 
the EcoRI fragment from pCGN551 containing the intact 
kanamycin resistance gene inserted into the EcoRI site to 
provide pCGN552 having the kanamycin resistance gene in 
the proper orientation. ^ 

This ocs/KAN gene was used to provide a selectable 
marker for the trans type binary vector pCGN587. 

The 5' portion of the engineered octopine synthase pro- 
moter cassette consists of pTiA6 DNA from the Xhol at bp 
15208- 13644 (Barker's numbering), which also contains the 50 
T-DNA boundary sequence (border) implicated in T-DNA 
transfer. In the plasmid pCGN587, the ocs/KAN gene from 
pCGN552 provides a selectable marker as well as the right 
border. The left boundary region was first cloned in 
M13mp9 as a HindHI-Smal piece (pCGN502) (base pairs 55 
602-2213) and recloned as a KpnI-EcoRI fragment in 
pCGN565 to provide pCGN580. pCGN565 is a cloning 
vector based on pUC8-Cm, but containing pUC18 linkers. 
pCGN580 was linearized with BamHI and used to replace 
the smaller Bglll fragment of pVCK102 (Knauf and Nester, 60 
Plasmid (1982) 8:45), creating pCGN585. By replacing the 
smaller Sail fragment of pCGN585 with the Xhol fragment 
from pCGN552 containing the ocs/KAN gene, pCGN587 
was obtained. 

The pCGN594 HindlH-BamHI region, which contains an 65 
5'-ocs-kanamycin-ocs-3' (ocs is octopine synthase with 5' 
designating the promoter region and 3' the terminator region. 
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see U.S. application Ser. No. 775,923, filed Sep. 13, 1985) 
fragment was replaced with the Hindm-BamHI polylinker 
region from pUC18. 

pCGN566 contains the EcoRI-Hindm linker of pUC18 

5 inserted into the EcoRI-Hindm sites of pUC13-CnL The 
Hindm-BglH fragment of pNW31C-8,29-l (Thomashow et 
aL, Cell (1980) 19:729) containing ORF1 and -2 of pTiA6 
was subcloned into the Hindm-BamHI sites of pCGN566 
producing pCGN703. 

10 The Sau3A fragment of pCGN703 containing the 3' 
region of transcript 7 (corresponding to bases 2396-2920 of 
pTiA6 (Barker et aL, (1983) supra) was subcloned into the 
BamHI site of pUC18 producing pCGN709. The EcoRI- 
Smal polylinker region of pCGN709 was substituted with 

15 the EcoRI-Smal fragment of pCGN587, which contains the 
kanamycin resistance gene (APH3-II) producing pCGN726. 

The EcoRI-SaU fragment of pCGN726 plus the BglH- 
EcoRI fragment of pCGN734 were inserted into the BamHI- 

20 Sail site of pUC8-Cm producing pCGN738. pCGN726c is 
derived from pCGN738 by deleting the 900 bp EcoRI- 
EcoRI fragment. 

To construct pCGN167, the Alul fragment of CaMV (bp 
7144-7735) (Gardner et aL, NucL Acid Res. (1981) 

25 9:2871-2888) was obtained by digestion with Alul and 
cloned into the Hindi site of M13mp7 (Messing et aL, NucL 
Acids Res. (1981) 9:309-321) to create C614. An EcoRI 
digest of C614 produced the EcoRI fragment from C614 
containing the 35S promoter which was cloned into the 

30 EcoRI site-of pUC8 (Vieira and Messing, Gene (1982) 
19:259) to produce pCGN146. 

To trim the promoter region, the BglH site (bp 7670) was 
treated with BgUI and resected with Bal3 1 and subsequently 
a Bgin linker was attached to the Bal31 treated DNA to 

35 produce pCGN147. 

pCGN148a containing a promoter region, selectable 
marker (KAN with 2 ATG's) and 3' region, was prepared by 
digesting pCGN528 with Bgin and inserting the BamHI- 
BgUI promoter fragment from pCGN147. This fragment 

40 was cloned into the Bgin site of pCGN528 so that the Bgin 
site was proximal to the kanamycin gene of pCGN528. 

The shuttle vector used for this construct, pCGN528, was 
made as follows. pCGN525 was made by digesting a plas- 
mid containing Tn5 which harbors a kanamycin gene 

45 (Jorgenson et aL, MoL Gen. Genet (1979) 177:65) with 
Hindm-BamHI and inserting the Hindm-BamHI fragment 
containing the kanamycin gene into the Hindm-BamHI sites 
in the tetracycline gene of pACYC184 (Chang and Cohen, 
J. BacterioL (1978) 134:1141-1156). pCGN526 was made 
by inserting the BamHI fragment 19 of pTiA6 (TTiomashow 
et aL, Ceil ( 1980) 19:729-739), modified with Xhol linkers 
inserted into the Smal site, into the BamHI site of pCGN525. 
pCGN528 was obtained by deleting the small Xhol fragment 

55 from pCGN526 by digesting with Xhol and religating. 

pCGN149a was made by cloning the BamHI-kanamycin 
gene fragment from pMB9KanXXI into the BamHI site of 
pCGN148a. 

pMB9KanXXI is a pUC4K variant (Vieira and Messing, 
60 Gene (1982) 19:259-268) which has the Xhol site missing 
but contains a functional kanamycin gene from Tn903 to 
allow for efficient selection in Agrobacterium. 

pCGN149a was digested with Bgin and SphL This small 
BgUI-SphI fragment of pCGN149a was replaced with the 
65 BamHI-SphI fragment from MI (see below) isolated by 
digestion with BamHI and SphL This produces pC(3N167, 
a construct containing a full length CaMV promoter, 1ATG- 
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kanamycin gene, 3' end and the bacterial Tn903-type kana- 
mycin gene. MI is an EcoRI fragment from pCGN546X (see 
construction of pCGN587) and was cloned into the EcoRI 
cloning site of M13mp9 in such a way that the PstI site in 
the IXTG-kanamycin gene was proximal to the polylinker 5 
region of M 13mp9. 

The Hindm-BamHI fragment in the pCGN167 containing 
the CaMV-35S promoter, 1 ATG-kanamy ci n gene and the 
BamHI-fragment 19 of pTiA6 was cloned into the BamHI- 
Hindm sites of pUC19 creating pCGN976. The 35S pro- i° 
moter and 3' region from transcript 7 was developed by 
inserting a 0.7 kb HindHI-EcoRI fragment of pCGN976 
(35S promoter) and the 0.5 kb EcoRI-Sall fragment of 
pCGN709 (transcript 7:3') into the Hindm-Sall sites of 
pCGN566 creating pCGN766c. is 

The 0.7 Id) HindHI-EcoRI fragment of pCGN766c 
(CaMV-35S promoter) was ligated to the 1.5 kb EcoRI-Sall 
fragment in pCGN726c (1ATG-KAN 3* region) followed by 
insertion into the Hindm-Sall sites of pUC119 to produce 
pCGN778. The 2.2 kb region of pCGN778, Hindm-Sall 20 
fragment containing the CaMV-35S promoter and 1XTG- 
KAN-3' region was used to replace the HindHI-Sall linker 
region of pCGN739 to produce pCGN783. 

Transfer of the Binary Vector DCGN948 into Agrobac- ^ 
terium 

pCGN948 was introduced into Agrobacterium tutnefa- 
ciens EHA101 (Hood et al., /. Bacterid. (1986) 
168:1291-1301) by transformation. An overnight 2 ml cul- 
ture of EHA101 was grown in MG/L broth at 30° C. 0.5 ml 30 
was inoculated into 100 ml of MG/L broth (Garfinkel and 
Nester, 7. Bacteriol. (1980) 144:732-743) and grown in a 
shaking incubator for 5 h at 30° C. The cells were pelleted 
by centrifugation at 7K, resuspended in 1 ml of MG/L broth 
and placed on ice. Approximately, 1 ug of pCGN948 DNA 35 
was placed in 100 ul of MG/L broth to which 200 ul of the 
EHA101 suspension was added; the tube containing the 
DNA-cell mix was immediately placed into a dry ice/ethanol 
bath for 5 minutes. The tube was quick thawed by 5 minutes 
in 37° C. water bath followed by 2 h of shaking at 30° C. ^ 
after adding 1 ml of fresh MG/L medium. The cells were 
pelleted and spread onto MG/L plates (1.5% agar) contain- 
ing 100 mg/1 gentamicin. Plasmid DNA was isolated from 
individual gentamicin-resistant colonies, transformed back 
into E. coft and characterized by restriction enzyme analysis 45 
to verify that the gentamicin-resistant ERA 101 contained 
intact copies of pCGN948. Single colonies are picked and 
purified by two more streakings on MG/L plates containing 
100 mg/1 gentamicin. 
Transformation and Regeneration of B. Napus 50 
Seeds of Brassica napus cv Westar were soaked in 95% 
ethanol for 4 minutes. They were sterilized in 1% solution of 
sodium hypochlorite with 50 ul of "Tween 20" surfactant per 
100 ml sterile solution. After soaking for 45 minutes, seeds 
were rinsed 4 times with sterile distilled water. They were 55 
planted in sterile plastic boxes 7 cm wide, 7 cm long, and 10 
cm high (Magenta) containing 50 ml of '/loth concentration 
of MS (Murashige minimal organics medium, Gibco) with 
added pyridoxine (50 ug/1), nicotinic acid (50 ug/1), glycine 
(200 ug/l) and solidified with 0.6% agar. The seeds germi- 60 
nated and were grown at 22° C. in a 16h-8h light-dark cycle 
with the light intensity approximately 65 uEm~ 2 s -1 . After 5 
days the seedlings were taken under sterile conditions and 
the hypocotyls excised and cut into pieces of about 4 mm in 
length. The hypocotyl segments were placed on a feeder 65 
plate or without the feeder layer on top of a filter paper on 
the solidified B5 0/1/1 or B5 O/UO medium. B5 0/1/0 
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medium contains B5 salts and vitamins (Gamborg, Miller 
and Ojima, Experimental Cell Res. (1968) 50:151-158), 3% 
sucrose, 2 ,4-dichlorophenoxy acetic acid (1.0 mg/l), pH 
adjusted to 5.8, and the medium is solidified with 0.6% 

5 Phytagar; B5 0/1/1 is the same with the addition of 1.0 mg/l 
kinetin. Feeder plates were prepared 24 hours in advance by 
pipetting 1.0 ml of a stationary phase tobacco suspension 
culture (maintained as described in Fillatti et al., Molecular 
General Genetics (mi) 206:192-199) onto B5 0/1/0 or B5 

10 0/1/1 medium. Hypocotyl segments were cut and placed on 
feeder plates 24 hours prior to Agrobacterium treatment 

Agrobacterium tumefaciens (strain EHA 101x948) was 
prepared by incubating a single colony of Agrobacterium in 
MG/L broth at 30° C. Bacteria were harvested 16 hours later 

15 and dilutions of 10 8 bacteria per ml were prepared in MG/L 
broth. Hypocotyl segments were inoculated with bacteria by 
placing the segments in an Agrobacterium suspension and 
allowing them to set for 30-60 minutes, then removing and 
transferring to Petri plates containing B5 0/1/1 or 0/1/0 

20 medium (0/1/1 intends 1 mg/l 2,4-D and 1 mg/l kinetin and 
0/1/0 intends no kinetin). The plates were incubated in low 
light at 22° C. The co-incubation of bacteria with the 
• hypocotyl segments took place for 24-48 hours. The hypo- 
cotyl segments were removed and placed on B5 0/1/1 or 

25 0/1/0 containing 500 mg/l carbenicillin (kanamycin sulfate 
at 10, 25, or 50 mg/l was sometimes added at this time) for 
7 days in continuous light (approximately 65 uEM" 2 S _1 ) at 
22° C. The segments were transferred to B5 salts medium 
containing 1% sucrose, 3 mg/l benzylamino purine (BAP) 

30 and 1 mg/l zeatin. This was supplemented with 500 mg/l 
carbenicillin, 10, 25, or 50 mg/l kanamycin sulfate, and 
solidified with 0.6% Phytagar (Gibco). Thereafter, explants 
were transferred to fresh medium every two weeks. 

After one month green shoots developed from green calli 

35 which were selected on media containing kanamycin. 
Shoots continued to develop for three months. The shoots 
were cut from the calli when they were at least 1 cm high and 
placed on B5 medium with 1% sucrose, no added growth 
substances, 300 mg/l carbenicillin, and solidified with 0.6% 

40 phytagar. The shoots continued to grow and several leaves 
were removed to test for neomycin phosphotransferase II 
(NPTH) activity. Shoots which were positive for NPTH 
activity were placed in Magenta boxes containing B5 0/1/1 
medium with 1% sucrose, 2 mg/l indolebutyric acid, 200 

45 mg/l carbenicillin, and solidified with 0.6% Phytagar. After 
a few weeks the shoots developed roots and were transferred 
to soil. The plant were grown in a growth chamber at 22° C. 
in a 16-8 hours light-dark cycle with light intensity 220 
uEM~ 2 s~* and after several weeks were transferred to the 

50 greenhouse. 

Southern Data 

Regenerated B. napus plants from cocultivations of Agra- 
bacterium tumefaciens EHA101 containing pCGN948 and 
B. napus hypocotyl s were examined for proper integration 

55 and embryo-specific expression of the spinach leaf ACP 
gene. Southern analysis was performed using DNA isolated 
from leaves of regenerated plants by the method of Krul et 
al., 12th Annual UCLA Symposium on Plant Molecular 
Biology, Apr. 16-23, 1983, J. Cell Biochem Supp. O (7 part 

60 B), p. 265 and purified once by banding in CsCl. DNA (10 
ug) was digested with the restriction enzyme EcoRL elec- 
trophoresed on a 0.7% agarose gel and blotted to nitrocel- 
lulose (see Maniatis et al, (1982) supra.). Blots were probed 
with pCGN945 DNA containing 1.8 kb of the spinach ACP 

65 sequence or with the EcoRI-HindHI fragment isolated from 
pCGN936c (made by transferring the Hindlll-EcoRI frag- 
ment of pCGN930 into pCGN566) containing the napin 5' 
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sequences labeled with 32 P-dCTP by nick translation 
(described by the manufacturer. BRL Nick Translation 
Reagent Kit. Bethesda Research Laboratories, Gaithersburg, 
Md.). Blots were prehybridized and hybridized in 50% 
formamide, 10x Denhardt's, 5xSSC, 0.1% SDS, 5 mM 5 
EDTA, 100 ug/ml calf thymus DNA and 10% dextran sulfate 
(hybridization only) at 42° C. (Reagents described in Mania- 
tis et al., (1982) supra.) Washes were in lxSSC, 0.1% SDS, 
30 min and twice in O.lxSSC, 0.1% SDS 15 min each at 55° 
C. io 

Autoradiograms showed two bands of approximately 3.3 
and 3.2 kb hybridized in the EcoRI digests of DNA from j 
four plants when probed with the ACP gene (pCGN945) 
indicating proper integration of the spinach leaf ACP con- 
struct in the plant genome since 3.3 and 3.2 kb EcoRI 15 
fragments are present in the T-DNA region of pCGN948. 
The gene construct was present in single or multiple loci in 
the different plants as judged by the number of plant 
DNA-con struct DNA border fragments detected when 
probed with the napin 5* sequences. 20 

Northern Data 

Expression of the integrated spinach leaf ACP gene from 
the napin promoter was detected by Northern analysis in 
seeds but not leaves of one of the transformed plants shown ^ 
to contain the construct DNA. Developing seeds were col- 
lected from the transformed plant 21 days postanthesis. 
Embryos were dissected from the seeds and frozen in liquid 
nitrogen. Total RNA was isolated from the seed embryos and 
from leaves of the transformed plant by the method of ^ 
Crouch et al., (1983) supra, electrophoresed on 
fornialdehyde-containing 1.5% agarose gels as described 
(Shewmaker et al., Virology (1985) 140:281-288) and blot- 
ted to nitrocellulose (Thomas, Proc. Natl Acad. ScL U.S.A. 
(1980) 77:5201-5205). Blots were prehybridized, 35 
hybridized, and washed as described above. The probe was 
an isolated Pstl-BamHI fragment from pCGN945 containing 
only spinach leaf ACP sequences labeled by nick translation. 

An RNA band of -0.8 kb was detected in embryos but not 
leaves of the transformed plant indicating seed-specific 40 
expression of the spinach leaf ACP gene. 

Example 3 

Construction of B. Campestris Napin Promoter 

Cassette 45 

A Bgffl partial genomic library of B. campestris DNA was 
made in the lambda vector Charon 35 using established 
protocols (Maniatis et al., (1982) supra). The titer of the 
amplified library was ~1.2xl0 9 phage/ml. Four hundred 50 
thousand recombinant bacteriophage were plated at a den- 
sity of 10 5 per 9x9 in. NZY plate (NZYM as described in 
Maniatis et al, (1982) supra) in NZY+10 mM MgSo 4 +0.9% 
agarose after adsorption to DH1 E. coli cells (Hanahan, MoL 
Biol (1983) 166:557) for 20 min at 37° C. Plates were 55 
incubated at 37° C. for -13 hours, cooled at 4° C. for 2.5 
hours and the phage were lifted onto Gene Screen Plus (New 
England Nuclear) by laying precut filters over the plates for 
approximately 1 min and peeling them off. The adsorbed 
phage DNA was immobilized by floating the filter on 1.5M 60 
NaCl, 0.5M NaOH for 1 min., neutralizing in 1.5M NaCl, 
0.5M Tris-HCl, pH 8.0 for 2 min and 2xSSC for 3 min. 
Filters were air dried until just damp, prehybridized and 
hybridized at 42° C. as described for Southern analysis. 
Filters were probed for napin-containing clones using an 65 
Xhol-Sall fragment of the CDNA clone BE5 which was 
isolated from the B. campestris seed cDNA library described 
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using the probe pNl (Crouch et al., (1983) supra). Three 
plaques were hybridized strongly on duplicate filters and 
were plaque purified as described (Maniatis et al., (1982) 
supra). 

5 One of the clones named lambda CGN1-2 was restriction 
mapped and the napin gene was localized to overlapping 2.7 
kb Xhol and 2.1 kb Sail restriction fragments. The two 
fragments were subcloned from lambda CGN1-2 DNA into 
pCGN789 (a pUC based vector the same as pUC119 with 

10 the normal polylinker replaced by the synthetic linker - 5* 
GG AATTC GTCG AC AGATCTCTGCAGCTC- 
GAGGGATCCAAGCTT y (which represents the 
polylinker EcoRJU Sail, BgUL PstI, Xhol, Bamffl, HindHI). 
The identity of the subclones as napin was confirmed by 

15 sequencing. The entire coding region sequence as well as 
extensive 5* upstream and 3' downstream sequences were 
determined (FIG. 2). The lambda CGN1-2 napin gene is that 
encoding the mRNA corresponding to the BE5 CDNA as 
determined by the exact match of their nucleotide sequence. 

20 An expression cassette was constructed from the 5' -end 
and the 3 -end of the lambda CGN1-2 napin gene as follows 
in an analogous manner to the construction of pCGN944. 
The majority of the napin coding region of pCGN940 was 
deleted by digestion with Sail and religation to form 

25 pCGN1800. Single-stranded DNA from pCGN1800 was 
used in an in vitro mutagenesis reaction (Adelman et al., 
DNA (1983) 2:183-193) using the synthetic oligonucleotide 
5' GCITGTTCGCCATGGATATCiTCTGTArGTTC 3'. 
This oligonucleotide inserted an EcoRV and an Ncol restric- 

30 tion site at the junction of the promoter region and the ATG 
start codon of the napin gene. An appropriate mutant was 
identified by hybridization to the oligonucleotide used for 
the mutagenesis and sequence analysis and named 

35 pCGN1801. 

A 1.7 kb promoter fragment was subcloned from 
pCGN1801 by partial digestion with EcoRV and ligation to 
pCGN786 (a pCGN566 chloramphenicol based vector with 
the synthetic linker described above in place of the normal 

40 polylinker) cut with EcoRI and blunted by filling in with 
DNA Polymerase I KLenow fragment to create pCGN1802. 
3* sequences from the lambda CGN1-2 napin gene were 
added to Xhol-Hindm digested pCGN1802 from pCGN941 
digested with Xhol and HindUL The resulting clone, 

45 pCGN1803, contains approximately 1.6 kb of napin 
3'-sequences as well as promoter sequences, but a 326 
nucleotide HindHI fragment normally found at the 3 -end of 
lambda CGN1-2 is inserted opposite to its natural orienta- 
tion. As a result, there are two HindHI sites in pCGN1803. 

SQ This reversed fragment was removed by digestion of 
pCGN1803 with HindHI. Following religation, a done was 
selected which now contained only approximately 1.25 kb 
of the original 1.6 napin 3'-sequence. This clone, 
pCGN1808, is the lambda CGN1-2 expression cassette and 

55 contains 1.725 kb of napin promoter sequence, and 1.265 kb 
of napin 3' sequences with the unique cloning sites Sail, 
Bgll, PstI, and Xhol in between. Any sequence that requires 
seed-specific transcription or expression in Brassica, for 
example, a fatty acid gene, can be inserted in this cassette in 

^ a manner analogous to that described for spinach leaf ACP 
and the B. napus napin cassette (see Example 2). 
pCGN3223 Napin Expression Cassette 
pCGN1808 is modified to contain flanking restriction 
sites to allow movement of only the expression sequences 

65 and not the antibiotic resistance marker to binary vectors 
such as pCGN1557 (McBride and Summerfelt (1990) PI 
MoL Biol 14:269-276). Synthetic oligonucleotides contain- 
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ing KpnL NotI and Hindm restriction sites are annealed and 
ligated at the unique Hindm site of pCGN1808, such that 
only one Hindm site is recovered. The resulting plasmid, 
pCGN3200 contains unique Hindm, NotI and Kpnl restric- 
tion sites at the 3'-end of the napin 3'-regulatory sequences 5 
as confirmed by sequence analysis. 

The majority of the napin expression cassette is subcloned 
from pCGN3200 by digestion with Hindm and Sad and 
ligation to Hindm and Sad digested pIC19R (Marsh, et al. 
(1984) Gene 32:48 1-485) to make pCGN3212. The extreme 10 
5'-sequences of the napin promoter region are reconstructed 
by PGR using pCGN3200 as a template and two primers 
flanking the SacI site and the junction of the napin 
5-promoter and the pUC backbone of pCGN3200 from the 
pCGN1808 construct. The forward primer contains Clal, 15 
Hindm, NotI, and Kpnl restriction sites as well as nucle- 
otides 408-423 of the napin 5'-sequence (from the EcoRV 
site) and the reverse primer contains the complement to 
napin sequences 718-739 which include the unique SacI site 
in the 5'-promoter. The PGR was performed using in a 
Perkin Elmer/Cetus thermocycler according to manufactur- 20 
er's specifications. The PGR fragment is subcloned as a 
blunt-ended fragment into pUC8 (Vieira and Messing (1982) 
Gene 19:259-268) digested with Hindi to give pCGN3217. 
Sequence of pCGN3217 across the napin insert verifies that 
no improper nucleotides were introduced by PCR. The napin 25 
5-sequences in pCGN3217 are ligated to the remainder of 
the napin expression cassette by digestion with Clal and 
SacI and ligation to pCGN32 12 digested with Clal and SacI 
The resulting expression cassette pCGN3221, is digested 
with Hindm and the napin expression sequences are gel 30 
purified away and ligated to pIC20H (Marsh, supra) digested 
with Hindm. The final expression cassette is pCGN3223, 
which contains in an ampicillin resistant background, essen- 
tially identical 1.725 napin 5' and 1.265 3' regulatory 
sequences as found in pCGN1808. The regulatory regions 35 
are flanked with Hindm, NotI and Kpnl restriction sites and 
unique Sail, BglHL PstI, and Xhol cloning sites are located 
between the 5' and 3' noncoding regions. 

Example 4 

40 

Isolation of Other Seed Specific Promoters 

Other seed-specific promoters may be isolated from genes 
encoding proteins involved in seed triacylglycerol synthesis, 
such as acyl carrier protein from Brassica seeds. Immature 
seeds were collected from Brassica campestris cv. "R-500 " 45 
a self-compatible variety of turnip rape. Whole seeds were 
collected at stages corresponding approximately to 14 to 28 
days after flowering. RNA isolation and preparation of a 
CDNA bank was as described above for the isolation of a 
spinach ACP CDNA clone except the vector used was 50 
pCGN 565. T o probe the cDNA bank, the oligonucleotide 
(5 *)- ACTTTCTCAACTGTCTCTGGTTTAGC AGC-(3 ') 
was synthesized using an Applied Biosystems DNA 
Synthesizer, model 3 80 A, according to manufacturer's rec- 
ommendations. This synthetic DNA molecule will hybridize 
at low stringencies to DNA or RNA sequences coding for the 
amino acid sequence (ala-ala-lys-pro-glu-thr-val-glulys- 
val). This amino acid sequence has been reported for ACP 
isolated from seeds of Brassica napus (Slabes et al., "Oil 
Seed Rape Acyl Carrier Protein (ACP) Protein and Gene 
Structure" 697-700, 1986, 7th International Symposium on 60 
The Metabolism, Structure, and Function of Plant Lipids 
(eds. P K. Stumpf, J. B. Mudd, W. D Ness) Prenum Press, 
New York and London); ACP from B. campestris seed is 
highly homologous. Approximately 2200 different cDNA 
clones were analyzed using a colony hybridization technique 65 
(Taub and Thompson, Anal Biochem. (1982) 126:222-230) 
and hybridization conditions corresponding to Wood et al. 
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(Proc. Natl Acad. Sci (1985) 82:1585-1588). DNA 
sequence analysis of two cDNA clones showing obvious 
hybridization to the oligonucleotide probe indicated that 
one, designated pCGNlBcs, indeed coded for an ACP- 

5 precursor protein by the considerable homology of the 
encoded amino acid sequence with ACP proteins described 
from Brassica napus (Slabas et al., 1980 supra). Similarly to 
Example 3, the ACP CDNA clone, pCGNlBCS, was used to 
isolate ACP genomic clones containing the regulatory infor- 
mation for expression of ACP during triacylglyceride syn- 

10 thesis in the seeds. DNA was isolated from B. campestris cv. 
R500 young leaves by the procedure of Scofield and Crouch 
(7. Biol. Chem. (1987) 262:12202-12208). A Sau3A partial 
genomic library of the B. campestris DNA was made in the 
lambda vector Embl 3 (Stratagene, San Diego, Calif.) using 
established protocols (Maniatis et al., (1982) supra) and 
manufacturer's instructions. The titer of the library was 
-1.0x10 s phage/mL Six hundred thousand recombinant bac- 
teriophage were plated and screened as described in 
Example 3 with the exception that the E. coli host cells used 
were strain P2392 (Stratagene, San Diego, Calif.). Filters 

20 were prehybridized and hybridized at 42° C in 25 ml each 
of hybridization buffer containing 50% foimamide, 10X 
Denhardt's, 5X SSC, 5 Mm EDTA, 0.1% SDS, and 100 
ug/ml denatured salmon sperm DNA (reagents described in 
Maniatis et aL, (1982) supra). The probe used in these 

25 hybridizations was 0.2 ug of a nick-translated 530 base pair 
Bgin-Dral fragment of pCGNlBcs, the B. campestris ACP 
CDNA clone described above. Six plaques were hybridized 
strongly on duplicate filters after washing the filters at 55° C. 
in O.lx SSC/0.2% SDS, and were plaque-purified as 

30 described (Maniatis et al., (1982) supra). 

Restriction analysis followed by Southern hybridization 
was performed on some of the clones using the hybridization 
conditions and radiolabeled probe described above. One 
clone, Bcg4-4, contains the ACP gene on two overlapping 

35 restriction fragments, an -5.1 kb SstI fragment and an -1.2 
kb Hindlll fragment These restriction fragments were sub- 
cloned into the cloning vector pCGN565. The DNA 
sequence of some regions of the subclones verified by 
homology that Bcg4-4 is an ACP gene. The sequence also 
shows that this particular ACP gene is expressed in plants, 

40 as the sequence in the coding region matches exactly the 
sequence of the PCGNLBcs ACP cDNA except for three 
regions. These regions are believed to be intervening 
sequences, a common element of eukaryotic genes that is 
spliced out during processing of mRNA (Padgett et al., Ann. 

45 Rev. Biochem. (1986) 55:1119-1150). Further restriction 
mapping of the SstI subclone identified an Xhol fragment 
containing -1.5 kb of 5* sequence upstream from the Xhol 
site near the 5' end of the PCGNLBCS cDNA clone. This 
Xhol fragment was subcloned in opposite orientations in the 

so cloning/sequencing vector Bluescript + (Stratagene, San 
Diego, Calif.) and the clones were designated pCGN1941 
and pCGN1941\ DNA sequencing of 1 kb of the DNA 
upstream of the coding region was completed. Also, the 
complete sequence of the 1.2 kb HindJH subclone described 

55 above was determined. The DNA sequence derived from the 
clones described above is shown in FIG. 3. Additional 
sequences at the 3' end of the ACP gene were subcloned on 
an -1.6 kb Sstl-BgLH fragment into Bluescript + and Blue- 
script - (clones are designated pCGN1940 and pCGN1940'). 
The SstI site in these clones is the one found at the 3' end of 

60 the ACP coding region of PCGNlBcs. 

An expression cassette can be constructed from the 5* 
upstream sequences and 3' downstream sequences of Bcg4-4 
as follows. The PCGN1941 Xhol subclone is used for the 5* 
regulatory region. This clone contains the Xhol insert in the 

65 opposite orientation of the lacZ gene. The 3' regulatory 
region is altered to allow cloning as a Pstl-BgUI fragment 
into PCGN565 by oligonucleotide site-directed mutagen- 
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esis. Single-stranded DNA is made from pCGN1940 and 
altered by mutagenesis as described (Adelrnan et al„ supra) 
with the synthetic oligonucleotide 5' CTTAAGAAGTAAC- 
CCGGGCTGCAGTTTTAGTATTAAGAG 3*. This oligo- 
nucleotide provides Smal and PstI restriction sites just after 5 
the TAA stop codon of the pCGNlBcs cDNA. The Pstl-BgUI 
3' fragment is then cloned into the PstI and BamHI sites (the 
BamHI restriction site is destroyed in this process) of 
PCGN565. The resulting clone is digested with PstI and 
Smal, and the fragment inserted into the corresponding sites JQ 
in PCGN1941 (described above) in the same orientation as 
the 5' region. The resulting clone comprises the ACP expres- 
sion cassette with PstL EcoRI, and EcoRV sites available 
between the 5* and 3' regulatory regions for the cloning of 
genes to be expressed under the regulation of these ACP 
gene regions. 15 

Example 5 

Isolation of Seed-specific cDNA Clone, EA9 

Ninety-six clones from the 14-28 day postanthesis B. 20 
campestris seed cDNA library (described in the previous 
example) were screened by dot blot hybridization of mini- 
prep DNA on Gene Screen Plus nylon filters (NEN Research 
Products, Boston, Mass.). The probes used were radioac- 
trvely labeled first-strand synthesis cDNAs made from the 2 s , 
day 14-28 postanthesis seed mRNA or from B. campestris 
leaf MRNA. Clones which hybridized strongly to seed 
cDNA and little or not at all to leaf cDNA were catalogued. 
A number of clones were identified as representing the seed 
storage protein napin by cross-hybridization with an Xhol- 
Sall fragment of pNI (Crouch et aL, (1983) supra), a B. 
napus napin cDNA. One of these napin clones, BES, was 
used in Example 3 to identify a B. campestris genomic clone 
as a source of an embryo-specific promoter. 

Another abundant class of cDNA clones were those 
represented by a clone designated EA9. EA9 cross- 35 
hybridized to seven other cDNA clones of 600 cDNAs 
screened by dot blot hybridization and was highly expressed 
ir- seeds and not in leaves. Northern blot analysis of mRNA 
i ated from day 14 postanthesis whole seed, and day 21 
£ v 28 postanthesis embryos using a 700 bp EcoRI fragment 40 
t\ rA9 (see below) as a probe shows that EA9 is highly 
e pressed at day 14 and expressed at a much lower level at 
day 21 and day 28 postanthesis. Because the embryo is so 
small at day 14, it was suspected that the predominant 
expression of EA9 might be in a tissue other man the 45 
embryo. Total RNA was isolated (Crouch et aL, (1983) 
supra) from whole seed (14, 15, 17 and 19 days 
postanthesis), seed coats (day 14 and day 21 postanthesis) 
and embryos (day 21 postanthesis). Twenty-five ug of each 
sample were analyzed by Northern blot analysis as described 50 
in Example 2. The probe used was a 0.7 kb EcoRI DNA 
fragment isolated from the EA9 cDNA and labeled by 
nick-translation. The results of the Northern analysis 
showed the EA9 RNA was detected in whole seed at all 
times tested and in seed coats, but not in the embryo. A 
separate Northern analysis of whole seed RNA from days 13 55 
through day 3 1 postanthesis (in two day intervals) indicated 
that EA9 was highly expressed between days 13 to 21 but 
was barely detectable by day 27 postanthesis. 

In Situ Hybridization 

Seed-coat specific expression of EA9 was confirmed by in 60 
situ hybridization analysis. Day 14 and 21 postanthesis 
whole seeds of B. campestris were fixed in a 4% paraform- 
aldehyde phosphate buffered saline (PBS) solution. The 
tissue was then dehydrated through a graded tertiary-butyl 
alcohol (TBA) series, infiltrated with paraplast and cast into 65 
paraffin blocks for sectioning (Berlyn and Miksche, Botani- 
cal Microtechnique and Cytochemistry (1976), Iowa State 
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University Press). Five um longitudinal sections of the 
embedded seeds (one cell-layer thickness) were generated 
on a Reichert Histostat rotary microtome. The paraffin 
ribbons containing the seed sections were then affixed to 
5 gelatin-chrome alum subbed slides (Berry n Miksche, (1976) 
supra). 

Single-stranded radiolabeled RNA probes were made 
using the Riboprobe reaction system (Promega, Madison, 
Wis.). This system utilizes a vector which is derived from 

10 pUC12 and contains a bacteriophage SP6 promoter which 
lies immediately upstream from an M13 polylinker. First, 
the 700 bp EcoRI fragment was isolated from EA9 and 
subcloned into the polylinker region of the riboprobe vector 
in both orientations (sense and anti-sense). To generate a 
template for the transcription run-off transcription reactions, 
the recombinant plasmids were propagated, purified, and 
linearized with HindUL The templates were then incubated 
in a reaction mixture containing the SP6 RNA polymerase, 
triphosphates and 35 S-UTP (as described by the 
manufacturer). After adding RQ DNase (Promega), the 

20 labeled RNAs were run over Boehringer pre-packed Sepha- 
dex spin columns to remove unincorporated triphosphates. 

The slides containing the sectioned seeds were hybridized 
with the radiolabeled sense and anti-sense RNA transcripts 
of EA9 according to the methods of Singer et al. 

25 (Biotechniques (1986) 4:230-241) and Taylor and Mar- 
tineau {Plant. Physiol (1986) 82:613-618). The hybridized 
slides were then treated with nuclear track emulsion NTB-3, 
(Eastman Kodak Company, Kodak Materials for light 
Microscope Autoradiography, 1986) sealed in a light-tight 

30 box and exposed for 4 weeks at 5°-10° C. After bringing the 
slides to room temperature they were developed in D-19 
developer (Eastman Kodak Company), rinsed, fixed and 
dehydrated through a graded alcohol series. Cover slips 
were mounted with cytoseal (VWR Scientific). 

35 Hybridization of the radiolabeled anti-sense EA9 ribo- 
probe was seen only in the seed coat tissue of both day 14 
and 21 seeds. No hybridization of the radiolabeled sense 
EA9 riboprobe was seen in any seed tissues. 
DNA Sequence and Gene Copy Number 

40 The restriction map and sequence of the EA9 CDNA 
clone have been determined (FIG. 4). Identification of a 
polyadenylation signal (Proudfoot and Brownlee, Nature 
(1976) 263:211-214) and of polyA tails at the 3'-end of EA9 
indicated the orientation of the cDNA clone and the direc- 

45 tion of transcription of the mRNA. The function of the 
encoded protein is unknown at this time. 

EA9 is a member of a small gene family as shown by 
Southern blot analysis. DNA was isolated from B. campes- 
tris leaves (as described in Example I, Southern analysis), 

50 digested with either BamHL, Bgin or HindlH and probed 
with a labeled fragment of EA9. Three fragments of 
genomic DNA hybridized in both BamHI and BglH digests. 
Only 2 bands hybridized in the Hindm digest. The data 
suggests that the EA9 family comprises between one and 

55 three genes. 

The sequence of EA9 is used to synthesize a probe which 
identifies a unique class of Brassica seed-specific genes from 
a genomic library in the manner described in Examples II 
and HI. The regulatory sequences of these genes are used to 
construct an expression cassette similar to those described 

60 for the napin genes, with the EA9 construct directing seed 
coat specific expression of any gene inserted in it. 

Example 6 

65 Other Seed Specific Examples 

Other seed-specific genes also can serve as useful sources 
of promoters. cDNA clones of cruciferin, the other major 
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seed storage protein of B, napus, have been identified 
(Simon et. aL, (1985) supra) and could be used to screen a 
genomic library for promoters. Without knowing the specific 
functions, yet other CDNA clones can be classified as to 
their level of expression in seed tissues, their timing of 5 
expression (i.e., when postanthesis they are expressed) and 
their approximate representation (copy number) in the B. 
campestris genome. Clones fitting the criteria necessary for 
expressing genes related to fatty acid synthesis or other seed 
functions can be used to screen a genomic library for 1Q 
genomic clones which contain the 5' and 3'regulatory 
regions necessary for expression. The non-coding regulatory 
regions can be manipulated to make a tissue- specific expres- 
sion cassette in the general manner described for other genes 
in previous examples. 

Example 7 

Construction of Tomato Ripe Fruit cDNA Bank and 
Screening for Fruit-Specific Clones 

Tomato plants (Lycopersicon esculentum cv UC82B) 20 
were grown under greenhouse conditions. Poly(A) + RNA 
was isolated as described by Mansson et aL, MoL Gen. 
Genet. (1985) 200:356-361. The synthesis of cDNA from 
poly(A) +RNA prepared from ripe fruit, cloning into the PstI 
site of the plasmid pUC9 and transformation into an E. coli 25 : 
vector were all as described in Mansson et aL, MoL Gen. 
Genet. (1985) 200:356-361. | 

Library Screening 

Two thousand recombinant clones were screened by 
colony hybridization with radiolabeled CDNA made from 30 j 
tomato red fruit MRNA, immature green fruit mRNA, and 
leaf mRNA. Bacterial colonies immobilized onto Gene- j 
Screen Pius filters (New England Nuclear), were denatured 
in 1.5M NaQ in 0.5M NaOH, then neutralized in 1.5M Nad 
in 0.5M Tris-HCl pH 8, and allowed to air dry. 35 
Hybridization, washing and autoradiography were all per- 
formed as described in Maniatis et aL, Molecular Cloning: 
A Laboratory Manual (1982) Cold Spring Harbor, New 
York. 

Sixty-five clones were selected which had more intense 40 
hybridization signals with fruit cDNA than with leaf cDNA 
and therefore appeared to be under-represented in the leaf 
MRNA population relative to the fruit population. Replicate 
slot blot filters were prepared using purified DNA from the 
selected clones and hybridized with radioactive cDNA from 45 
leaf, green fruit, and red fruit as before. This allowed 
selection of cDNA clone 2A11, also referred to as 
pCGN1299 which is on at high levels in both the fruit stages 
(red and green) and off in the leaf. 

Example 8 

Analysis of Clones 

Synthesis of RNA Probes 

The cDNA insert of pCGN1299 was excised as an EcoRI 
to HindHI fragment of approximately 600 bp (as measured 
on an agarose gel), and subcloned into the Riboprobe vector 
pGEMl (Promega Biotec), creating pCGN488. 32 P-labeled 
transcripts made from each strand of the pCGN488 insert 
using either SP6 or T7 polymerase were used as probes in 
separate Northern blots containing mRNA from leaf, imma- 
ture green and mature red fruits. The RNA transcript from 
the SP6 promoter did not hybridize to the tomato mRNA. 
However, the transcript from the T7 promoter hybridized to 
an MRNA of approximately 700 nt in length from the green 
fruit and the red fruit but not to mRNA from tomato leaf. The 
direction of transcription of the corresponding mRNA was 
thus determined. 



60 
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The tissue specificity of the pCGN1299 CDNA was 
demonstrated as follows. RNA from root, stem, leaf, and 
seven stages of fruit development (immature green, mature 
green, breaker, turning, pink, light red, and red) was sized on 

5 formaldehyde/agarose gels according to the method 
described by Maniatis et al., (1982), immobilized on nitro- 
cellulose and hybridized to "P-labeled RNA which was 
synthesized in vitro from pCGN488 using T7 polymerase. 
Each lane contained 100 ng of polyA + RNA except for two 
lanes (pink and light red lanes) which contained 10 ug of 

10 total RNA. The Northern analysis of mRNA from root, stem, 
leaf, and various stages of fruit development indicated that 
pCGN1299 cDNA was expressed in all stages of fruit 
development from the early stages immediately after anthe- 
sis to red ripe fruit. No MRNA hybridizing to pCGN1299 

15 was found in leaf, stem, or root tissue. The size of the mRNA 
species hybridizing to the pCGN488 probe was approxi- 
mately 700 nt 

Message abundance corresponding to the pCGN1299 
cDNA was determined by comparing the hybridization 

20 intensity of a known amount of RNA synthesized in vitro 
from pCGN488 using SP6 polymerase to mRNA from red 
tomato fruit in a Northern blot. The 32 P-labeled transcript 
from pCGN488 synthesized in vitro using T7 polymerase 
was used as a probe. The Northern analysis was compared 

25 to standards which indicated that the pCGN1299 cDNA 
represents an abundant MRNA class in tomato fruit, being 
approximately 1% of the message. 

Example 9 

30 Sequencing of pCGN1299 and pCGN1298 CDNA 

Clones 

DNA Sequencing 

The polyA + sequence was missing from pCGN1299 

35 CDNA. A longer CDNA clone, pCGN1298, therefore was 
identified by its hybridization with the pCGN488 probe. The 
complete DNA sequence of the two cDNA inserts was 
determined using both Maxam-Gilbert and the Sanger 
dideoxy techniques and is as follows. The sequence of 
pCGN1298 contains additional sequences at both the 5* and 
3* end compared to pCGN1299. As shown in FIG. 8, the 
sequences are identical over the region that the two clones 
have in common. 

Amino Acid Sequence 

45 The pCGN1299 CDNA sequence was translated in three 
frames. The longest open reading frame (which starts from 
the first ATG) is indicated. Bom pCGN1299 andpCGN1298 
have an open reading frame which encodes a 96 amino acid 
polypeptide (see FIG. 8). The protein has a hydrophobic 
N-terminus which may indicate a leader peptide for protein 
targeting. A hydrophobicity profile was calculated using the 
Hopp and Woods, (Prvc. NatL Acad. Sci. USA. (1981) 
78:3824-3828) algorithm. Residues 10-23 have an 
extremely hydrophobic region. A comparison of 2A11 to pea 
storage proteins and other abundant storage proteins is 

55 shown in FIG. 6. The sulfur-rich composition of the fruit- 
specific protein is similar to a pea storage protein which has 
recently been described (see Higgins et al., J. Biol. Chem. 
(1986) 261:11124-11130, for references to the individual 
peptides). This may indicate a storage role for this fruit- 

60 specific protein abundant species. 

Example 10 

Screening Genomic Library for Genomic Clones 

65 Southern Hybridization 

Southern analysis was performed as described by Mania- 
tis et al., 1982. Total tomato DNA from cuitivar UC82B was 
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digested with EcoRI or Hindm, separated by agarose gel 
electrophoresis and transferred to nitrocellulose. Southern 
hybridization was performed using a 32 P-labeled probe 
produced by nick translation of pCGN488 (Maniatis et al., 
1982). The simple hybridization pattern indicated that the 5 
gene encoding pCGN1299 cDNA was present in a few or 
perhaps even one copy in a tomato genome. 
Isolation of a Genomic Gone 

A genomic library established in Charon35/Sau3A con- 
structed from DNA of the tomato cultivar VFNT-Cherry was 10 
screened using the [ 32 P-]-RNAfrom cDNA clone pCGN488 
as a probe. A genomic clone containing approximately 12.5 
kb of sequence from the tomato genome was isolated. The 
region which hybridizes to a pCGN488 probe spans an Xbal 
restriction site which was found in the cDNA sequence and 15 
includes the transcriptional initiation region designated 
2A11. 

Sequence of Genomic Clone The DNA sequence of the 
genomic clone was determined by Sanger dideoxy tech- 
niques and is as shown in FIG. 7. The sequence of the 20 j 
genomic clone is identical to the pCGN1299 cDNA clone 
over the region they have in common. 

Subcloning 

The region surrounding the Xbal restriction site, approxi- 25 
mately 2.4 kb in the 5' direction and approximately 2.1 kb 
in the 3' direction was subcloned to provide an expression 
cassette. The 5' Xhol to Xbal fragment and the 3' Xbal to 
EcoRI fragment from the 2A11 genomic clone were inserted 
into a pUC-derived chloromphenicol plasmid containing a 30 
unique Xhol site and no Xbal site. This promoter cassette 
plasmid is called pCGN1273. 

Example 11 

Construction of Fruit-Specific Antisense Cassette 35 
Insertion of Antisense Fragment 

The 2A11 genomic fragment was tagged with PG anti- 
sense sequences by insertion of PG into the unique Xbal site 
of the pCGN1273 promoter cassette in the antisense orien- 40 
tation. The inserted sequences increased the size of the 
mRNA over the endogenous transcript, and thus the expres- 
sion pattern of the construct could be compared to the 
endogenous gene by a single Northern hybridization in a 
manner analogous to the detection of a tuber-specific potato 45 
gene described by Eckes et al., Mol Gen. Genet. 1986 
205:14-22. 

Example 12 

Insertion of Tagged Genomic Construction Into 50 
Agrobacterium Binary Vectors 

The tagged genomic construction is excised using the 
flanking Xhol restriction enzyme sites and is cloned into the 
unique Sail site of the binary plasmid pCGN783 (see 55 
Example 2 for construction) containing a plant kanamycin 
resistance marker between the left and right borders to 
provide plasmid pCGN1269. 

This plasmid binary vector in E. coli C2110 is conjugated 
into A. tumefaciens containing a disarmed Ti-plasmid eo ! 
capable of transferring the polygalacturonase antisense cas- 
sette and the kanamycin resistance cassette into the plant 
host genome. 

The Agrobacterium system which is employed is A. 
tumefaciens PC2760 (G. Ooms et al., Plasmid (1982) 65 
7:15-29; Hoekema et al., Nature (1983) 303:179-181; Euro- 
pean Patent Application 84-200239.6, 2424183). 
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Example 13 

Transfer of Genomic Construction to Tomato via 
Cocultivation 

5 Substantially sterile tomato cotyledon tissue is obtained 
from seedlings which have been grown at 24° C, with a 16 
hr/8 hr day/night cycle in 100x25 mm petri dishes contain- 
ing Murashige-Skoog salt medium and 0.8% agar (pH 6.0). 
Any tomato species may be used, however, here the inbred 

10 breeding line was UC82B, available from the Department of 
Vegetable Crops, University of California, Davis, Calif. 
95616. The cotyledons are cut into three sections and the 
middle placed onto feeder plates for a 24-hour preincuba- 
tion. The feeder plates are prepared by pipetting 0.5 ml of a 

15 tobacco suspension culture (10 6 cells/ml) onto 0.8% agar 
medium, containing Murashige minimal organic medium 
(K.C. Biologicals), 2,4-D (0.1 mg/1), kinetin (1 mg/1), 
thiamine (0.9 mgVl) and potassium acid phosphate (200 
mg/1, pH 5.5). The feeder plates are prepared two days prior 

20 to use. A sterile 3 mm filter paper disk containing feeder 
medium is placed on top of the tobacco cells after the 
suspension cells are grown for two days. 
Following the preincubation period, the middle one third 

^ of the cotyledon sections are placed into a liquid MG/L broth 
culture (1-5 ml) of the A. tumefaciens strain. The binary 
plasmid pCGN1269 is transferred to A. tumefaciens strain 
2760 by conjugation ox by transformation selecting for 
Gentamicin resistance encoded by the plasmid pCGN1269. 

^ The cotyledon sections are cocultivated with the bacteria for 
48 hrs. on the feeder plates and then transferred to regen- 
eration medium containing 500 mg/1 carbenicillin and 100 
mg/1 kanamycin. The regeneration medium is a K. C. 
Biologicals Murashige-Skoog salts medium with zeatin (2 

35 mg/1) myo-inositol (100 mg/1), sucrose (20 g/1), Nitsch 
vitamins and containing Q/8% agar (pH 6.0). In 2-3 weeks, 
shoots are observed to develop. When the shoots are 
approximately 1.25 cm, they are excised and transferred to 
a Murashige and Skoog medium containing carbenicillin 

^ (500 mg/1) and kanamycin (50 mg/1) for rooting. Roots 
develop within 10-12 days. 

Shoots which develop and subsequently root on media 
containing the kanamycin are tested for APH3TL enzyme. 
An aminoglycoside phosphotransferase enzyme 

45 (APH3TT) assay is conducted on putative transformed 
tomato plants and shoots. APH3TI confers resistance to 
kanamycin and neomycin. APH3TTs activity is assayed 
(Reiss et aL, Gene (1984) 30:211-218) employing electro- 
phoretic separation of the enzyme from other mterfering 

50 proteins and detection of its enzymatic activity by in situ 
phosphorylation of kanamycin. Both kanamycin and [Y~ 32 P] 
ATP act as substrates and are embedded in an agarose gel 
which is placed on top of the polyacrylamide gel containing 
the proteins. After the enzymatic reaction, the phosphory- 

55 lated kanamycin is transferred to P-81 phosphocellulose ion 
exchange paper and the radiolabeled kanamycin is finally 
visualized by autoradiography. The Reiss et aL, supra 
method is modified in the final washing of the P-81 ion 
exchange paper by rinsing in 0.1 mg/ml of proteinase K. 

60 

Example 14 

Construction of Tagged 2A11 Plasmids In Binary 
Vectors 

65 The compete sequence of the 2 All genomic DNA cloned 
into pCGN1273 from the Xhol site (position 1 at the 5* end) 
to the EcoRI site (position 4654) is shown in FIG. 7. 
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pCGN1267 was constructed by deleting from pCGN1273 
a portion of the plasmid polylinker from the EcoRV site to 
the BamHI site. Two DNA sequences were inserted into 
pCGN1273 at the unique Xbal site (position 2494). This site 
is in the 3' non-coding region of the 2A11 genomic clone 5 
before the poly A site. 

pCG1^1273 was tagged with 360 bp (from base number 1 
to 360) from the 5* region of the tomato polygalacturonase 
(PG) cDNA clone, Fl (Sheehy et aL MoL Gen. Genet. 
(1987) 208:30-36) at the unique Xbal restriction enzyme w . 
site. The tag was inserted in the antisense orientation result- 
ing in plasmid pCGN1271 and in the sense orientation 
yielding plasmid pCGN1270. Each plasmid was linearized 
at the unique Bgin restriction enzyme site and cloned into 
the binary vector pCGN783 at the unique BamHI restriction *5 
enzyme site. 

pCGN1273 was also tagged with a 0.5 kb fragment of 
DNA (base number 1626 to 2 1 15) from a PG genomic clone 
(see FIG. 8) which spans the 5' end of intron/exon junction. 
This fragment was cloned into the Xbal site resulting in 20 
plasmid pCGN1215. pCGN1215 was linearized at the 
unique Bgin site and cloned into pCGN783 at the BamHI 
site resulting in two plasmids, pCGN1219 and pCGN1220, 
which differ only in the orientation of pCGN1215 within 
pCGN783. 25 

Three DNA sequences were inserted into pCGN1267 at 
the unique restriction enzyme sites (position 2402, 2406). 
These sites are in the 3' non-coding region of the 2A11 
genomic clone, 21 bp from the stop codon. The 383 bp Xbal 3Q 
fragment from the PG cDNA clone was cloned into the Qal 
site of pCGN1267 after filling in the Xbal and Clal ends 
with Klenow and blunt ligation. The fragment in a sense 
orientation resulted in plasmid pCGN1263 and in the anti- 
sense orientation gave pCGN1262. pCGN1263 was linear- 
ized at the unique Bgin site and cloned into pCGN783 at the 
BamHI site yielding pCGN!260. pCGN1262 was also lin- 
earized at the Bgin site and cloned into pCGN783 at the 
BamHI site resulting in two plasmids, pCGN1255 and 
pCGN1258. which differ only in the orientation of 
PCGN1262 in the binary vector pCGN783. 

The 0.5 kb fragment of the PG genomic clone spanning 
the intron/exon junction (supra) was cloned into pCGN1267 
at the Qal site in an antisense direction yielding plasmid 
pCGN 1225 . This plasmid was linearized at the Bgin restric- 45 
tion enzyme site and cloned in to pCGN783 at the BamHI 
site producing two plasmids, pCGN1227 and pCGN1228, 
which differ only in the orientation of pCGN1225 in the 
binary vector. 

The EcoRI fragment (base numbers 5545 to 12,823) 50 
(Barker et al., Plant MoL Biol (1983) 2:335-350) from the 
octopine plasmid pTiA6 of A tumefaciens (Knauf and 
Nester, Plasmid (1982) 8:45-54) was subcloned into pUC 19 
at the EcoRI site resulting in plasmid pCGN71. A Rsal 
digest allowed a fragment of DNA from bases 8487 to 9036 55 
of the EcoRI fragment to be subcloned into the vector ml3 
BlueScript Minus (Stratagene, Inc.) at the Smal site result- 
ing in plasmid pCGN1278. This fragment contains the 
coding region of the genetic locus designated tmr which 
encodes a dimethylallyl transferase (isopentenyl transferase) 60 
(Akiyoshi et al., Proc. Natl Acad, Set. U.SA. (1984) 
81:5994-5998; Barry et al., ibid (1984) 81:4776-4780). An 
exonuclease/mung bean treatment (Promega Biotech) pro- 
duced a deletion on the 5' end of the tmr gene to a point 39 
base pairs 5' of the start codon. The tmr gene from 65 
pCGN1272 was subcloned into the Qal site of pCGN1267. 
The tmr gene in the sense orientation yielded pCGN1261 
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and in the antisense orientation gave plasraid pCGN1266. 
pCGN1261 was linearized at the Bgllt site and cloned into 
pCGN783 at the BamHI site resulting in plasmid 
pCGN1254. pCGN1266 was also linearized at the BglH site 
5 and subcloned into pCGN783 at the BamHI site yielding 
two plasmids, pCGN1264 and pCGN1265, which differ only 
in the orientation of pCGN1266 in pCGN783. 

Analysis of Expression in Transgenic Plants 

Immature green fruit (approximately 3.2 cm in length) 
10 was harvested from two tomato plants cv. UC82B that had 
been transformed with a disarmed Agrobacterium strain 
containing pCGN1264. Transgenic plants are designated 
1264-1 and 1264-11. The pericarp from two fruits of each 
plant was ground to a powder under liquid N 2 , total RNA 
1 5 extracted and poly A + mRNA isolated (as described in Mans- 
son et al., Mol. Gen. Genet (1985) 200:356-361). Young 
green leaves were also harvested from each plant and 
polyA* mRNA isolated. 

Approximately 19 ug of total RNA from fruit, 70 ng of 
20 polyA + mRNA from fruit and 70 ng of polyA + mRNA from 
leaves from transformed plants 1264-1 and 1264-11 was run 
on a 0.7% agarose formaldehyde Northern gel and blotted 
onto nitrocellulose (Maniatis et al., Molecular Cloning: A 
Laboratory Manual (1982) Cold Spring Harbor, New York). 
Also included on the gel as a negative control was approxi- 
mately 50 ng of polyA + mRNA from leaf and immature 
green fruit of a nontransformed UC82B plant. 

As a positive control and to help in quantitating mRNA 
^ levels, in vitro transcribed RNA from pCGN1272 was 
synthesized using T3 polymerase (Stratagene, Inc.). Nine- 
teen pg and 1.9 pg of this in vitro synthesized RNA were 
loaded on the Northern gel. 

The probe for the Northern filter was the 1.0 kb tmr insert 
35 DNA (a Kpnl to SACI fragment) from pCGN1272 isolated 
by electroelution from an agarose gel (Maniatis, supra 
(1982)) and labeled by nick translation (Bethesda Research 
Laboratory kit) using ot 32 p dCTP (Amersham). 

The Northern filter was prehybridized at 42° C. for 5 hrs. 
40 in the following solution: 25 ml formamide, 12.5 ml 20x 
SSC, 2.5 ml 1M sodium phosphate, 5 ml 50x Denhardts, 0.5 
ml 10% SDS, 1 ml 250 mM EDTA, 1 ml 10 mg/ml ssDNA 
and 2 ml H 2 0. Then one-fifth volume of 50% dextran sulfate 
and approximately 2.2xl0 7 cpm of the probe was added and 
45 hybridization was for 15 hrs. at 42° C. 

The Northern filter was washed in 2x SSC and 0.1% SDS 
at 550 for 20 minutes each wash. The filter was allowed to 
air dry before being placed with Kodak XAR film and an 
intensifying screen at "70° for two days. 
50 Northern Results on Transgenic Plants 

The nicked tmr probe hybridized with a mRNA species 
approximately 1.7 kb in length was observed in the total 
RNA and polyA + mRNA fruit lanes of the Northern blot 
This is the expected length of the reintroduced 2A11 gene 
55 (0.7 kb) tagged with the tmr gene (1.0 kb) in the antisense 
orientation. The level of expression from the reintroduced 
tagged gene is somewhat lower than the level of expression 
of the endogenous 2A11 gene. The level of expression of the 
reintroduced gene in immature green fruit is higher than the 
60 expression level in leaf tissue with a small amount of 
hybridizing mRNA in leaf tissue in these transformants. 

Example 15 

65 Different Sized 2A11 5' Regions 

The design of the 2A11 cassette is shown in FIG. 9. The 
cassette contains 3.8 kb of DNA 5* of the transcriptional start 
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site and the entire 3* region (from the TGA stop codon to a 
site 2.0 kb 3' of the poly A addition site) of the 2A11 gene. 
FIG. 7 shows the restriction sites and indicates (below the 
representation of the gene) the regions of the 2A11 gene 
used to construct the 2A11 cassette. The 2A11 cassette was 5 
constructed as follows. 

Transcriptional Initiation Region 

The 5* end of the 2A11 cassette was constructed starting 
with an EcoRI subclone genomic clone as described in 
application PCTUS88/01811 cloned into the EcoRI of Blue- 10 
script (+) (Stratagene) resulting in pCGN1288. This clone 
contains sequences from the EcoRI site at position 1651 in 
the intron of the 2A11 gene to the EcoRI site located 2.5 Kb 
upstream of the Xhol site at position 1 of the sequenced 
region (see FIG. 7). The XHOI fragment from position 1 of 15 
the sequenced region to the XHOI site in the Bluescript 
polylinker was deleted creating plasmid pCGN2004 which 
contain the 2A11 region from position 1 to position 1651. 
The coding region of 2 All was deleted by treating this 
plasmid with Exonucleasem/Sl using the commercially 20 
available Erase-a-Base Kit (Promega Biotec) and sequenc- 
ing deletion plasmids until one was found which had the 
coding region deleted to position 1366. The resulting 
plasmid, pCGN125 1, had the genomic region from the Xhol 
site (position 1) to position 1366. The EcoRI fragment of 25 ,' 
pCGN1288 was then transferred to a chloramphenicol resis- 
tant plasmid vector, pCGN2015, to make pCGN1231. 
pCGN2015 is a Cm resistant derivative of the Bluescript 
plasmid. A BstEEI/BamHI fragment of pCGN125 1 was then 
transferred into BstEH/Bamffl digested pCGN1231 to make 30 
pCGN1235 which contains the region from the EcoRI site 
(2.5 kb upstream of the sequenced region) to position 1366 
of the sequenced region flanked by the Bluescript polylinker 
in a Cm resistant vector. 

Transcriptional and Translational Termination Region 35 : 

The 3* end of the 2A11 cassette was constructed from 
pCGN1273 (described in application PCI7US8801811) by 
digesting the plasmid with Pvul and EcoRI, isolating the 
2249 bp insert (from position 2402 to 4653), ligating with a 
double-stranded oligonucleotide containing the sequence 40 '< 
shown in FIG. 7 from the Bamffl sticky end to a Pvul sticky 
end into a Bluescript vector which had been digested with 
Bamffl and EcoRI The resulting plasmid, pCGN1238, 
contains the 3* end of the 2A11 gene from the stop codon at 
position 2381 to the EcoRI site at position 4653. 45 

Final Construction 

Several versions of the 2A1 1 cassette in different vectors 
with different flanking restriction sites have been con- 
structed; maps of the plasmids are shown in FIG. 10. ^ 

A cassette containing the 5* and 3' regions of the 2A11 
gene was constructed by ligating the BamHI to EcoRI insert 
of pCGN1238 into pCGN1235 which had been digested with 
Bamffl and Xbal (the Xbal site having been filled in with 
Klenow polymerase to make a blunt-ended fragment). The 55 
resulting plasmid, pCGN1240, has the 5' end of the 2A11 i 
gene from the EcoRI site 2.5 kb upstream of the Xhol site 
(position 1) to position 1366 (which is located between the 
transcriptional initiation site of the 2A1 1 gene and the ATG), 
followed by a polylinker region with sites for Smal, Bamffl, ^ 
PstI and Sail which can be conveniently used to insert genes 
followed by the 3' region from position 2381 to 4653. The 
plasmid backbone of pCGN1240 is the Bluescript Cm 
plasmid described above. 

Construction of Plasmid pCGN1241 65 

A more convenient version has the EcoRI of pCGN1240 
excised and inserted into a Bluescript vector called 
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pCGN1239 which has an altered polylinker region such that 
the entire cassette can be excised as a Sacl-Kpnl fragment. 
The altered Bluescript vector, pCGN1239, was constructed 
by modifying the BlueScript polylinker from the Sad site to 

5 the Kpnl site including a synthetic polylinker with the 
following sequence: AGCTCGGTACCGAATTC- 
GAGCTCGGTAC to create a polylinker with the following 
sites: SacI-KpnI-EcoRI Sacl-Kpnl. The EcoRI insert of 
pCGN1240 was inserted into pCGN1239 to make 

10 pCGN1241 (see FIG. 9). 

Construction of pCGN2610 and pCGN2611 
A chloramphenicol resistant version of the 2A1 1 promoter 
cassette was constructed by inserting the synthetic 
polylinker described above (see construction of pCGN1241) 

15 into pCGN2015 to make pCGN1246, followed by insertion 
of the EcoRI fragment of pCGN1241 to make pCGN2610 
and pCGN2611 which differ only by the orientation of the 
inserted fragment in the plasmid vector (see FIG. 8). 

20 Example 16 

Comparison of Expression from Different Sized 
2A11 5* Regions 

A beta-glucuronidase (Gus) reporter gene was used to 
25 evaluate the level of expression and tissue specificity of 
various 2A11-Gus constructions. Gus is a useful reporter 
gene in plant systems because it produces a highly stable 
enzyme, there is little or no background (endogenous) 
enzyme activity in plant tissues, and the enzyme is easily 
30 assayed using fluorescent or spectrophotometric substrates. 
See, for example, Jefferson Plant MoL Biol Rep. (1988) 
5:387-405. Histochemical stains for Gus enzyme activity 
are also available which can be used to analyze the pattern 
of enzyme expression in transgenic plants. Jefferson (1988), 
35 supra. 

Constructions containing 1.3 kb (short), 1.8 kb 
(intermediate length), or 3.8 kb (long) 2A11 5* sequences 
fused to the Gus reporter gene were prepared. In addition, 

^ constructions were prepared which have altered 3' ends. Hie 
altered 3' ends are either a shorter 2A11 3* end from tr5 of 
the T-DNA of the Ti plasmid (Willmitzer et al M Embo. J. 
(1982) 1:139-146; Willmitzer et al., Cell (1983) 
42:1045-1056. The constructions were transferred to a 

45 binary vector (pCGN1578), and used in A Tumefaciens 
cocultivations. The resulting binary was used to transform 
tomato plants. The transgenic plants obtained were fluoro- 
metrically analyzed for Gus enzyme activity. 

Example 17 

50 

Screening Genomic Library for Polygalacturonase 
Genomic Clones 

Isolation of a Genomic Clone 

55 An EcoRI partial genomic library established in Charon 4 
constructed from DNA of a Lycopersicon esculentum culti- 
var was screened using a probe from the polygalacturonase 
CDNA (Sheeny et al., MoL Gen. Genet. (1987) 208:30-36). 
A lambda clone containing an approximately 16 kb insert 

gQ was isolated from the library, of which an internal 2207 bp 
HindUI to EcoRI was sequenced. The HindlE-EcoRI frag- 
ment includes the polygalacturonse promoter region. 
Sequence of Genomic Clone 

The DNA sequence of the genomic clone was determined 
65 by Sanger dideoxy techniques and is as shown in FIG. 8. The 
sequence of the genomic clone bases 1427 to 1748 are 
homologous to the polygalacturonase cDNA sequence. 
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The above results demonstrate the ability to identity 
inducible regulatory sequences in a plant genome, isolate the 
sequences and manipulate them. In this way, the production 
of transcription cassettes and expression cassettes can be 
produced which allow for differentiated cell production of 5 
the desired product. Thus, the phenotype of a particular plant 
part may be modified, without requiring that the regulated 
product be produced in all tissues, which may result in 
various adverse effects on the growth, health, and production 
capabilities of the plant Particularly fruit-specific transcrip- JQ 
tion initiation capability is provided for modifying the 
phenotypic properties of a variety of fruits to enhance 
properties of interest such as processing, organoleptic 
properties, storage, yield, or the like. Further, the results 
demonstrate one can use transcriptional initiation regions 
associated with the transcription of sequences in seeds in 15 
conjunction with sequences other than the normal sequence 
to produce endogenous or exogenous proteins or modulate 
the transcription of expression of*nucleic acid sequences. In 
this manner, seeds can be used to produce novel products, to 
provide for improved protein compositions, to modify the 20 
distribution of fatty acid, and the like. 

It is also evident from the above results that not only can 
soybean be transformed, so as to introduce heterologous 
genes, but transformed soybean cells may be regenerated 
into plants and the plants demonstrate the phenotype of the 25 
heterologous gene. In addition, native promoters can find 
use in conjunction with heterologous genes and retain their 
capability to be induced in the same manner as the native 
gene. Therefore, one can provide for regulated expression of 
a heterologous gene, where regulation may be by an external 30 
condition, such as light. Furthermore, Ti- or Ri-DNAmay be 
employed for introducing the heterologous gene as part of an 
expression cassette into the soybean cell without formation 
of a tumor and the resulting cells grown in culture and plants 
regenerated from the cells. By appropriate choice of various 35 
genes, various properties of the cell may be enhanced by 
introduction of additional copies of a homologous gene or 
new phenotypes may be provided by expression of heter- 
ologous genes. In addition, mutated genes may be employed 
which can impart novel properties to the host cell, providing ^ 
for host resistance to biocides, enhanced production of 
specific metabolites or products at the same or different 
times from the normal regulated expressions, or the like. 

All publications and patent applications mentioned in this 
specification are indicative of the level of skill of those 45 
skilled in the art to which this invention pertains. All 
publications and patent applications are herein incorporated 
by reference to the same extent as if each individual publi- 
cation or patent application was specifically and individually 
indicated to be incorporated by reference. 50 

Although the foregoing invention has been described in 
some detail by way of illustration and example for purposes 
of clarity of understanding, it will be obvious that certain 
changes and modifications may be practiced within the 
scope of the appended claims. 55 

What is claimed is: 

1. A method for obtaining a plant having a regulatable 
phenotype, said method comprising; 

transforming a host plant cell with a DNA construct under 
genomic integration conditions, wherein said construct 60 
comprises as operably linked components in the direc- 
tion of transcription, a promoter region obtainable from 
a gene, wherein transcription of said gene is preferen- 
tially regulated in a plant fruit tissue a DNA sequence 
of interest other than the native coding sequence of said 65 
gene, and a transcription termination region, wherein 
said components are functional in a plant cell, 



